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GAGTATAATATGTAAACTAACAATCGAGAATGTTTAAATAATCCCAAAAA 
SHI IHLIVSSYKFIRVF 
LILYI LLALTNLLGFL 
SYYTFDC.LLQiY.GF 
jHind Ml 

ataagagttca'agcttttg^ 
tattctcaagttcgaaaacccttaaatta^ 
irvqafgnlimvgyifk 
efkllvi .sw.vifs 
yksssfw. fnhgrlyfq 

aacttgtaacctgcattttgtctctttatttcatgcaatattcttttcct 

i i i i I i i i i I ) i i i I i i i i I ■ i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I 
TTGAACCTTGGACGTAAACCAGAGAAATAAAGTACGTTATCCGAAAAGGA 

TCNLHFVSLFHA I FFS 
KLVTC I LSLYFMQYSFP 
NL PAFCLFI SCN I LFL 

TGATTGGCTT ACGTC ATT TACT TGAG^ 
ACTAACCGAATGCAGTAAATGAACTCA^ 

LIGLRHLLELAHM. LFK 
LAYVIYLS.LICNCLN 
DWLTSFT.VSSYVTV. 

TATTTGGGATTATTGGTTAACGGAT 
CTAAACCCTAATAACCAATTGCCTATTT 
YLGLLVNG. KKL I DFRY 
IWDYWLTDKKN L I LD 

IFGI IG.RIKKIN. F. I 

27 X [TA] 

ATGCT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT ATATAT AT A 
I i i » N i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I 
TACG A TAT A TAT AT A TAT AT AT A TAT AT A TAT AT AT AT A TAT AT AT AT AT 

NAIYIYIYIYIYIYIY 
TMLYIYIYIYIYIYIYS 
QCYIYIY1YIYIYIYIY 

T AT ATATAT AT TATAGGTAGAAACTTGGT^ 

atatatatataatatccatctttga^ 

iyiyyr.klgi ihtyvr 
yiyiigrnlv.ftrmfa 
iyil.vetwynshvcs 
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TTTATCTGAATAAAATGAGT^ 

aaatagacttattttactcatcagg^ 

Fl IK.VVLSMQISLTP 
LSE.NE.SFQCRLVLL 
LYLNKMSSPFNAD . SYS 

ACTTGCAGATGCACGACCAATTT^ 

TGAACGTCTACGTGCTGGTTAAACGAACTAGTAGAAGGTATCTCGTGGTG 

LADARPICLI IFHRAP 
HLQMHDQFA. SSS I EHH 
TCRCTTNLLDHLP STT 



ACA NGA GTG 
T Pst I V 

AGCT AAGTCTCCGATGTGTTC^ 

TCGATTCAGAGGCTACACAAGATGACGTCCTCACGTTAGCTAACCACAGA 
QLSLRCVLLQECNRLVS 
S VSDVFYCRSA I DWCL 

AKSPMCSTAGVQS I GV 

GCTACGGAATGCTCGGCAACAAT 

CGATGCCTTACGAGCCGTTGTTAGAAGGGGGCGGGTCGCTCCACCAGTCA 
ATECSAT I FPRPARWSV 
LRNARQQSSPAQRGGQ 
CYGMLGNNLPPPSEVVS 

CTCTACAAATCCAACAACATCGC^ 

GAGATGTTTAGGTTGTTGTAGCGCTCCTACTCTGAGATGCTAGGTTTGGT 

STNPTTSRG.DST IQT 
SLQ I QQHREDETLRSKP 
LYKSNN I ARMRLYDPNQ 
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GGCCGCCCTGCAAGCCCTCAG^ 

ccggcgggacgttcgggagIccttg^ 

rppckpsgtptskscwm 
grpaspqelqhpspvgc 
aalqalrnsn i qvlld 

tcccccgatccgacgtgcagtcactggcctccaatccttcggccgccggc 

I i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i I 
AGGGGGCTAGGCTGCACGTCAGTGACCGGAGGTTAGGAAGCCGGCGGCCG 

SPDPTCSHWPP i LRPPA 
PP I RRAVTGLQSFGRR 
VPRSDVQSLASNPSAAG 

jBamHI 

gactg'gatccggaggaacgtcgtcgccta^ 

CTGACCTAGGCCTCCTTGCAGC^ 

TGSGGTSSPTGPASPF 
RLDPEERRRLLAQRLLS 
DWI RRNVVAYWPSVSFR 

ATACAT AGCTGTCGGAAACGAGCTGATCCC^ 
TATGTATCGACAGCCTTTGC^ 

DT.LSETS.SPDRIWRS 
I HSCRKRADPR ! GSGAV 
Y I AVGNEL I PGSDLAQ 
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ACATCCTCCCCGCCATGCG^ 
TGTAGGAGGGGCGGTACGCGTTGTAG^ 
TSSPPCATSTMLCPRLA 
HPPRHAQHLQCFVLGW 
Y I LPAMRN IYNALSSAG 

| Sal I 

CTGCAAAACCAGATCA AGGTCTCGACC^ 

GACGTTTTGGTCTAGTTCCAGAGCTGGC^ 

CKTRSRSRPRSTRASS 
PAKPDQGLDRGRHGRPR 
LQNQ I KVSTAVDTGVLG 

CACGjCCTACCCTCCCTCCGCCGGCGC^ 

GTGCAGGATGGGAGGGAGGCGGCCGCGG^ 

ARPTLPPPAPSPPPPRR 
HVLPSLRRRLLLRRPGV 
TSYPPSAGAFSSAAQA 

ACCTGAGCCCCATCGTGCAGTTCTT 
TGGACTCGGGGTAGCACGTCAAGA^ 

T APSCSSWRVTERRSW 
PEPHRAVLGE . RSAAP 
YLSP I VQFLASNGAPLL 

jSmal iBglll 

GTCAATGTGjACCCTTATTTT 

CAGTTACACATGGGAATAAAATCGATGTGGCCGTTGGGCCCTGTCTAGAG 

SMCTL I LATPATRDRS 
GQCVPLF. LHRQPGTDL 
VNVYPYFSYTGNPGQ I S 

GCTGCCCTACGCCCTGT TCACGGCC^ 

CGACGGGATGCGGGACAAGTGCCGGAGGCCGCAGCAGCACGTCCTACCCG 
RCPTPCSRPPASSCRMG 
AALRPVHGLRRRRAGWA 
LPYAL FTASGVVVQDG 

jSall 

GATTCAGCT ATCAGAACCTGTTC^ 

CTAAGTCGATAGTCTTGGACAAGCTGCGGTAGCAGCTGCGCCAGAAGCGC 
DSA I RTCSTPSSTRSSR 
I QLSEPVRRHRRRGLR 
RXSYQNLFDA I VDAVFA 



FIG. 15D-1 
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GCGCTGGAGAGAGTGGGAGGGG^ 
CGCGACCTCTCTCACCCTCCC^ 

RWREWEGRTWRWWCRR 
GAGESGRGERGGGGVGE 
ALERVGGANVAVVVSES 

CGGGTGGCCGTCGGCGGGCGGAGGAGCCG^ 

GCCCACCGGCAGCCGCCCGCCTCCTCGGCTTCGCTCGTGGTCGTTGCGCG 
AGGRRRAEEPKRAPATR 
RVAVGGRRSRSEHQQRA 
GWPSAGGGAEASTSNA 

AGACGTACAACCAGAACTTGATCAGGCATG 
TCTGCATGTTGGTCTTGAACTAGTC^ 
RRTT RT . SGMLAEERRG 
DVQP ELDQACWRRNAE 
QTYNQNL I RHVGGGTPR 

AGACCAGGGAAGGAGATCGAGGCATACA^ 

TCTGGTCCCTTCCTCTAGCTCCGTATGTATAAGCTCTACAAGTTGCTCTT 

DQGRRSRHTYSRCSTR 
ETREGDRG I H I RDVQRE 
RPGKE I EAY I FEMFNEN 

CCAGAAGGCTGGAGGGATCG^ 

GGTCTTCCGACCTCCCTAGCTCGTCTTGAAACCGGACAAAATAGGGTTGT 
TRRLEGSSRTLACF I PT 
PEGWRDRAE LWPV L SQQ 
CKAGG i EQNFGLFYPN 

-Hind III 

AGCAGCCCGTATACCAAATAA^ 

TCGTCGGGCATATGGTTTATTCGAAAATCTTTGATTGAACATTCCAACTA 
SSPYTK.AFRN. LVRLM 
AARIPNKLLETNL.G. 
KQPVYQ I SF KLTCKVD 

5 X [CTAC] 
GAATCATCTCCTACCTACCT^ 

CTTAGTAGAGGATGGATGGATGGATGGATGCTTATTTTGTACTTTATTTC 
NHLLPTYLPTNKT NK 
I ISYLPTYLRIKHEIK 
ESSPTYLPTYE NMK. S 



FIG. 15D-2 



Appln. Filing Date: Herewith 

TITLE: DNA REGULATORY ELEMENTS ASSOCIATED 
#H FRUIT DEVELOPMENT <^ft 
MIntor(s): Gregory D. MA Yet al. 
Application serial No: CIP OF 09/160,351 SHEET 28 of 94 



| EcoR I j CDNA EUCLS (POLY A) 

CACCAAAAT AAAGGGAGAATCT TGATC 
GTGGTTTTATTTCC<UcTTAGAACTA^ 

APK RENSDLGES I M M 

HQNKGRIL1LEKVES. 
TN I KGEF SWRKLNHD 

AT ATAT AACAAACACCCCTCTTTACT 
TATATATTGTTTGTGGGGAGAAATGAGTAA^ 
IYNKHPSLL I I SMLQVS 
Y I TNTPLYSLSVCYKF 
Dl QTPLFT HYQYVTSF 

TTGAAACTTGAACGGATCACAATTTGGACCTACAAGTATTTTGGGTCATA 
I i i i I i i i i 1 i i i i 1 i i i i 1 i i i i i i i i i I i i i i I i i i i I i i i i I i i i i I 
AACTTTGAACTTGCCTAGTGTTAAACCTGGATGTTCATAAAACCCAGTAT 

NLNGSQFGPTS I LGH 
LET TDHNLDLQVFWV i 

LKLER I TXWTYKYFGS 

ATTATTTCATTGAACTATA^ 
TAATAAAGTAACTTGATATATAAGTTT^ 

NYF I ELY I QKKMCLECL 
IISLNYIFKKRCVWSA. 
LFH T I YSKKDVFGVL 

ATACAGTATGACTTCAGTTTGCAAGATTACCTCTTCAGCGTCAGCTTCAG 

] I I I I M I I [ I I I I I I I 1 I I I I I I [ I I I I | 1 I I I | I H I j I ] I I ] I I II | 

TATGTCATACTGAAGTCAAACGTTCTAATGGAGAAGTCGCAGTCGAAGTC 
I QYD-FSLQDYLFSVSFS 
YSMTSVCK I TSSASAS 
NTV LQFARLPLQRQLQ 

CATGCCAAAAAACCATCATCTGC^ 
GTACGGTTTTTTGGTAGTAGACGA^ 

MPKNH HLLWGMFYT LM 
ACQKTI ICYGACFTP.W 
HAKKPSSAMGHVLHLDG 



FIG. 15E-1 
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TGCTACATCATCATCATTCATGTT^ 

ACGATGTAGTAGTAGTAAGTACAAAGTAAAATCCAGAGCACGAGAAATAT 
VLHHHHSCF I LGLVLF I 
CY! I I IHVSF.VSCSLY 
ATSSSSMFHFRSRALY 

TAGATCACATAAAAGTTTGGATCGCTT^ 
ATCTAGTGTATTTTCAAACCTA^ 

i T KFGSLQVSRLHCM 
RSHKSLDRFKFLGY I V 
IDHIKVWIASSF.VTLY 

GCAGCACTTTGAGCCTACTGAACATTGTGACTGCCTTTTAGAACATTGGA 

I 1 I I I I M I | I I I I [ 1 I I I i I I I 1 | I I I I [ I I 1 I 1 ! I I I | I I I I 1 I | M 1 

CGTCGTGAAACTCGGATGACTTGTAACACTGACGGAAAATCTTGTAACCT 

QHFEPTEHCDCL LEHW 
CSTLSLLNIVTAF.NIG 
AAL .AY.TL. LPFRTLD 

j Psil 

CTGCAGGAA 

I i ■ i I i i i ■ 3559 

GACGTCCTT 

TAG 
L Q E 
C R K 
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| Sal I 

agcgagg'tcgactaatgagctacta^ 
tcgctccagAtgattact^ 
sevd. atn i nvtdsnr 

ARSTNELLTLMSQ I V i 

qrgrlmsy.h.chr. 

atgagaagccgtatccaacacgcaatctgt^ 

tactcttcggcataggttgtgcgtI^ 

eavsntqsv7lvtgl 
dekpyptrnl7twsqdf 
mrsriqha ic7lghrts 

ttatccaaagactcgcctctgcgatttcccacattcacctcatttggtcc 
1 i i i 1 i i i i i i 1 1 i i i i i i i i i i i i 11 i 1 i i i i i i i i 11 i [ i i i 1 i i i i i 
aataggtttctgagcggagacgctaaagggtgtaagtggagtaaaccagg 

liqrlasaishihliws 
lskdsplrfptftsfgp 
ypktrlcdfphsphlv 

I Hind III 

ataggaagcttcacagcgggc^ 

tatccttcgaagtgtcgcccgtccttaggtaaagagatatattcgtggtg 
igsftagrnpflyistt 
easqragihfsi . a p 
hrklhsgqes i slykhh 

ctcccacccacaccaccacca^ 
gagggtgggtgtggtggtggtgatggtg^ 

shphhhhyhc.gg.rp 
ppthtttt ttakedegl 
lpptppplpllrrmkal 

gttgttggtcatctttaccctgg^ 
caacaaccagtagaaatgggaccgga^ 

ccwsslpwprrsapsps 
vvghlypglvarrlrra 
llv i ftlasslgafae 



FIG. 16A-1 
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AATGCGGAAGGCAAGCCGGGGGGG^ 

TTACGCCTTCCGTTCGGCCCCCCCGAGAGACGGGGCCGCCCGACACGACA 
NAEGKPGGLSAPAGCAV 
MRKASRGGSLPRRAVL 
QCGRQAGGALCPGGLCC 

jBamHI 

AGCCAGTACGGCTGGTGCGGTAACA^ 
TCGGTCATGCCGACCACGCCATTGTG^ 

ASTAGAVTR I HTAAKD 
PVRLVR. HGS I LRPRM 
SQYGWCGNTDPYCGQGC 
CCAGAGCCAATGCGGCGGTAGCG^ 

GGTCTCGGTTACGCCGCCATCGCCGCCATCGCCGCCACCGTCGCACCGGA 
ARANAAVAAVAAVAAWP 
PEPMRR . RR. RRWQRGL 
QSQCGGSGGSGGGSVA 

CGATCATCAGCTCCTCCCT 

GCTAGTAGTCGAGGAGGGAGAAGCTCGTCTACGACTTCGTAGCGTTGCTG 
RSSAPPSSSRC S I ATT 

DHQL LPLRADAEASQR 
S I I SSSLFEQMLKHRND 

GCAGCCTGCCCCGGCAAGGGTTTCTACA^ 

CGTCGGACGGGGCCGTTCCCAAAGATGTGCATGTTGCGGAAGTAGCGGCG 

QPAPARVSTRTTPSSP 
RSLPRQGFLHVQRLHRR 
AACPGKGFYTYNAF I AA 

CGCCAACTCCTTCAGCGGGTTCG 

GCGGTTGAGGAAGTCGCCCAAGCCCTGCTGGCCGCTGCTGGGTTCTTCTT 
PPTPSAGSGRPATTQEE 
RQL LQRVRDDRRRPKK? 
ANSFSGFGTTGDDPRR 
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NAAGGAGATCGCGGCTTT^ 
NTTCCTCTAGCGCCGAAAGAACC^ 
?GDRGFLGA?VSR?DR 
KEIAAFLA?TSH?TTG 
?RRSRLSWR?RLT?RQV 

ATTCNCACATCTCCCGAAGCTCGTAAAC 
TAAGNGTGTAGAGGGCTTCGAGC^ 

F?HLPKLVNCLWD?KL 
NSHISRSS.TVYGI7N. 
I 7TSPEARKLFMG? KTE 

ATGTTTGGGGTTTGGCAGGT^ 
TACAAACCCCAAACCGTCCAC^ 

NVWGLAGG7ATRPMVRT 
MFGVWQVG7RRARWSVR 
CLGFGRWVGDAPDGPY 

CCTTGGGTTACTGCTTCGTCCAANAACAAAACCCTCATCGGANTACTGCG 
1 i i i I i i i i I i i i i 1 i i i i ] i i i i I i i i i I i i i i I i i i i I i i i » I i i i ' I 
GGAACCCAATGACGAAGCAGGTTNTTGTTTTGGGAGTAGCCTNATGACGC 

PWVTASS7NKTL IG7LR 
LGLLLRP7TKPSS7YC 
ALGYCFVQ7QNPHR7TA 
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jPstl 

TCCCANCTCCCANTGQC^ 
AGGGTNGAGGGTNACCGGCACGCG^ 

P?S?WPCAAAKNTTAE 
VP? P7GRALQQK I LRPK 
P?LP?AVRCSKKYYGRS 

CCCN|CCAAATTTCATNGTNAGCC^ 

gggnaggtttaaagtancantcggtntaagantgtcaagnagcggcgcta 
a?pnf?vs? i ltv7rrd 
p?qis??a?f?qf?aai 
pskfh??p?s?ssspr 

cgagttcacaacgatgccntttctaacgc 
gctcaagtgttgctacggnaaagaItgcg^ 

rvhnda7snat i r c v ? r 
efttmpfltqqsdv7c 
sssqrc7f . rnnpmc7a 

tgcagcaantacaantacgggccg^ 
acgtcgttnatgttnatgcccggc^ 
aa7t7tgrpgepsv7t 

VQQ7Q7RAGRESHRF7? 
CS7Y7YGPAGRA I GSD? 

GNTCAACAACCCAGACCTGGTGGCCACNGACGCGACCATCTCNTTCAAGA 

I ' ' ' I ' i ' i 1 i i i i 1 i i i i 1 i i i i i i i i i I i i i i I i i i i I i i i i I i i i i i 
CNAGTTGTTGGGTCTGGACCACCGGTGNCTGCGCTGGTAGAGNAAGTTCT 

7STTQTWWP7TRPS7SR 
7QQPRPGGH7RDHL7QD 
7NNPDLVATDAT I SFK 

CGGN|CTGTGGTTTTGGATGACTCNT 
GCCNAGACACCAAAACCTACTGAGNAG 
R7CGFG . L7SRPSR7AT 
RSVVLDDSSVAQAVVP 
T7LWFWMT7QSPKP7CH 
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GACGTGAT A ACCGGGAGCTGG^ 
CTGCACTATTGGCCCTCGACCT^ 

T PGAGRHPTPTRRP 
RRDNRELDA I QRRPGGR 

DV I TGSWTPSNADQAAG 
AAGGCTTCCGGGCTACGGTGTCACCACC^ 
TTCCGAAGGCCCGATGCCACAGT 

EGFRATVSPPTSSMEGW 
KASGLRCHHQHHQWRVG 
RLPGYGVTTNI INGGL 

AGTGCGGGAAAGGGTA^ 

TCACGCCCTTTCCCATGCTACGGTCCCA^ 

SAGKGTMPGWR I GSAST 
VRERVRCQGGG DRLL 
ECGKGYDARVADR I GFY 

AAGAGGT ACTGCGACT TGCTGGGGG^ 

TTCTCCATGACGCTGAACGACCCCCACTCGATGCCTCTGTTGAACCTGAC 

RGTATCWG ATETTWT 
QEVLRLAGGELRRQLGL 
KRYCDLLGVSYGDNLDC 

CTACAACCAGAGACCCT 

GATGTTGGTCTCTGGGAAACGAAGATGTCGTCGATGTCGGTGTAAGATCG 
ATTRDRLLLQQLQPHSS 
LQPETLCFYSSYSH I LA 
YNQRPFASTAATATF 

GGTGAGCTA|GGAGACAAC^ 

CCACTCGATACCTCTGTTGAACCTCACGATGTTGGTCTCTGGGAAATGAA 

GELWRQLGVLQPET I YL 
VSYGDNLECYNQRPFT 
R . AMETTWSATTRDPLL 



FIG. 16B-2 



Appln. Filing Date: Herewith 

MjjLE: DNA REGULATORY ELEMENTgfcSOCIATED 

fruit development 
Inventor(s): Gregory D. MA Yet al. 
Application serial No: CIP of 09/160,351 Sheet 35 of 94 



agtccgatactactgtgacgaat 
tcaggctatgatgacactgcttaggt 

vryycdesm .rnkry 
sdttvtnpcnna i na i 
spill. rihvitq.tll 

actgagatagcgactccgtgagttgactgt 
IgacIctatcgctgaggcactcaac^ 

y dsdsvs l klrrks 

TE i ATP . VDCRSCGGSL 
LR.RLRELTVEVAEEV 

[Hind III 

tcaataaa'agcttanctacatacatgg^ 
agttattttcgaatngatgtat 
s i ka? lhtwptt i vdrd 
q. kl?y i hgpqlsltv 
fnksl7tymahnyr. p. 

tcatatgcatccatcaaatgtcctcaaat^ 
agtaIacgtaggtagtttacaggagttta 

hmhpsnvlkclgvskc 
iic1hqmssnvle.vna 
syas i kcpqmswsk . mr 
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TATTCGATCGGTAAAATGAAGATGTTAGAATAAATAAAATTAATTATTTT 

| I I I I i I I 1 I I I I I | I I I I | I I I I ] I II I | I I I I 1 I I I I | I I 1 I | I i M 1 

ATAAGCTAGCCATTTTACTTCTACAATCTTATTTATTTTAATTAATAAAA 
VFDR NEDVRI !MK I NYF 

YSIGKMKMLE. 1KLI IF 
IRSVK.RC.NK.N.LF 

TTTATAATTATAAATATTTTAATATATTTTTT 
AAATATTAATATTTATAAAATTATATA 

Fl I INILIYFLILKILK 
L.L. IF.YIF.S.RS. 
FYNYKYFN 1 FFNLKDPK 
AACCCAATTATAAGGATTTTA^ 

TTGGGTTAATATTCCTAAAATATATACC^ 

I.L.GFYIWIGILRIF 
KSNYKDF I YGLGY . EYL 
NLI IRILYMDWDTKNI. 

jBgl li 

attataaaaattaatatactttttaatc^ 
taatatttttaattatatgaaaaaItagaa^ 
nykn . ytf s . rsnyky 
i 1kinilfnlkdli isi 

L.KLIYFLILKI.L.V 
TTTCTATATGGATTGGGATATTAACT 

XXXXXTXTXXXTXXXXXTXTXXTTXXXXTXXXTXXXTXTTTTTXXXXTTX 

FLYGLGY.LDLLIK1LI 
FYMDWDINSIYL.KF. 
FS IWIGI LTRFTYKNFN 

ATAAAAATTTTAAATTTAAAAATTAAA^ 

TXTTTTTXXXXTTTXXXTTTTTXXTTTTXTXXTTTTTXTXXXTTTXTXTT 

KF. I KLKY.KYLNI 
YKNFKFKN NTKN I I 

IKI LNLKIKI LK ISKYN 
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CGGTAATCATGAGATCGAGAACGTGATGATTGAGATCATGAGATCGAGGT 

1 i ' i I i i i i I i i i i I i i i i 1 i i i i 1 i i i i I i i i i I i i i i | i i i i | i i i i I 

gccatt agt actct agctcttgcactact aactct agt actct agctcca 
tvimrsrt. lrs drg 
r . s drerdd. dhe i ev 
gnhe i envmi e imrsr 

tgagagtaaaaaggaaatta^ 

actctcatttttcctttaatgcaaIta 

e kgnyvnhgkfrfvc 
eskke i tl imgnfvlf 

LRVKRKLR.SWE I SFCL 
CACGGTCGAGATGGTGACCGTGGACACCT 

GTGCCAGCTCTACCACTGGCACCTGTGGATTGTAGGTGTTGGCCGTACGT 

TVEMVTVDT HPQPAC 
ARSRW. PWTPN i HNRHA 

HGRDGDRGHLTSTTGMQ 
ATAACCATGTTGTCATATGTTAGCTT 

TATTGGTACAACAGTATACAATCGAACAGAGTATAGAATACTGGTACTTA 
NNHVV I C . LVSYLMTMN 
ITMLSYVSLSHIL.P.I 
PCCHMLACL I SYDH 

CACAjAGTCTTCACGAATAT^ 

GTGTATCAGAAGTGCTTATAATTAATTCGGTCGAATCGTAGTGTCAAAAC 

HIVFTN1N.ASLASQFC 
T.SSRILIKPA.HHSF 
SHSLHEY. LSQLS I TVL 

CACCTTTGTACCATANCTGAAGTGTTCG^ 

GTGGAAACATGGTATNGACTTCACAAGCATACCGAACTGGGTAGGGCTCA 

TFVP7LKCSYGLTHPE 
APLYH? SVRMA. P I PS 

HLCT I 7EVFVWLDPSRV 
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GTATGGTCTCCCGGANCCTGGAG^ 
CATACCAGAGGGCCTNGGACC^ 

CMVSR? LERVNPRSS G 
VWSPG7WSVLTRGLVEG 
YGLP7PGAC.PEV.LR 

CGTATCTGGAACAANAGAATCCGTCTCC^ 

A. TL?S .AEVEDHSFSY 
HRPC7LRQRLK1TPLA 
G I DLV7LGRG . RSLL L 
jCCGT TGGGTGCCT AT AT A^ 

AGGCAACCCACGGATATATTTCCAGcH 

PLGAY I KVE IMRG I ?N 
IRWVPI RSKS GGF7T 

SVGCLYKGRNHEGDS L 
CGACCTATTCAATATTTGA^ 
GCTGGATAAGTTATAAACTCGATCGTTC^ 

STYS I FELARVGVTCMR 
RPIQYLS.QELELRV.G 
DLFNI .ASKSWSYVYE 

TTCGACCCCCAATGCTGTTCCTGGGGTC^ 
AAGCTGGGGGTTACGACAAGGACCCC^ 

FDPQCCSWGRFYTYSCM 
STPNAVPGVAF I P I PA 
VRPPMLFLGSLLYLFLH 
GTGAjCATACATAGTAGCTTTAATCATCTT 
CACTAGTATGTATCATCGAAATTAGTAGA^ 

SYIVALI IFSHHRTL 
CDHT. .L.SSSVI IVRW 
VI I HSSFNHLQSSSYVG 
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GTGCATGCATTGTCTAATTTACTCGA^ 
CACGTACGTAACAGATTAAATGAGCTA 

GACIV.FTRFN7VRHCF 
VHALSNLLDS7SFDTAS 
CMHCLIYSIQ7RSTLL 

0<hol 

CTACCTACTATGTGGCCCAATACATA^ 
GATGGATGATACACCGGGTTATGTAT^ 
LPTMWPNT LYCL I RPR 

YLLCGPIHSCIVSYGL 
PTYYVAQY I VVLSHTAS 

AGCAAAGCGTGTGCAGA^ 
TCGTTTCGCACACGTCTCCTT 

AKRVQRNCVKWLAGLG 
EQSVCRGTVSSGWLASG 

SKACAEELCQVVGWPRA 
TCATGGCATTGAGTTGGCTCGATACAAC^ 
AGTACCGTAACTCAACCGAGCTATGTT 

LMAL SWLDTTHRLRDTM 
SWH.VGSIQHIGLGIPC 
HG I ELARYNTSA GYH 

GGCTCAGTTAACACCATCAACTGTA 
PSLLW. LTCHVGWMPKY 
RVYCGS HVMWGGCQN 
AES I VVVDMSCGVDAK I 
|GCTATATCATTCTCTCCCTACAAAGGAG 
ACGATATAGTAAGAGAGGGATGTT^ 

AISFSPYKGVVP.ENR 
MLYHSLPTKELCHRRIV 
CYIILSLQRSCAIGESW 
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CTGTGCCGAACCCAAGACACCA 

GHGLGSVVGPCSPQLGG 
DTAWVLWSVLVRLSWVD 
TRLGFCGRSLFASVGW 

yy^?yy9^y?^9yy99??^y?y9yy999y999?^^9y^?^9yy99y^9 

AATGAAGTAGTTCAACCGGNAGACAACCGAC^ 
i-LHQVG? LLAGQSTLGR 
YF I KLA7CWLGKVHLV 
I TSSSWPSVGWAKYTW. 

?9^y99y?9^9^?^9^99^99^t99yy999y^t9^?yy99yyyy99^9 

cctaccagctctgttcnggttc^ 

DGRDK7KEGWLRLGFR 
GMVET7PRKVG DLVFD 
GWSRQ7QGRLAKTWFST 

AATCAATTGTTTATGAGGCGAATGGTAT^ 

ttagttaacaaatactccgcttaccatag^ 

Q S I VYEANG I PPLGCLL 
NQLFMRRMVSLRWGVCS 
I NCL GEWYPSVGVSA 

?yyy?9^yyy9yy9^9^y99^yy9yyy9yy9y^99^99?yy99yy99^yy 

CAAAGCTAAACAACGCTACCTAACAAACAAC 
VS I CCDGLFVVGGLVRL 
FRFVAMDCLL EAWFD 
RFDLLRWI VCCRRLGS I 

GCTCTTAAGTCGGGAGAAGGTATTT^ 
CGAGAATTCAXCCCTCTTCCATAAACNAT 

LLSREKVF7KEFNLTM 
CS.VGRRYL7RSSI .PC 
ALKSGEG I 7 GVQFDHV 
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jGAAGTGAATAAAAGGACT 

ACTTCACTTATTTTCCTGAACGGTTCTTCAAACCGAGCTGGCACAATTTC 
LK. I KGLAKKFGSTVLK 
SE KDLPRSLARPC S 

EVNKRTCQEVWLDRVK 

CCAGAG A ATGTGT ATGTCGAGGTCTATTCA ACC ATGTGGAAGCT AGAGAA 

I I N 1 I I 1 I | M I I | I I I I | i I ] I | I I I I | I I 1 I | I 1 I I | I I I I 1 I I I I | 

GGTCTCTTACACATACAGCTCCAGATAAGTTGGTATACCTTCGATCTCTT 
PENVYVEVYSTMWKLEN 
QRMCMSRSIQPCGS.R 
ARECVCRGLFNHVEARE 

jGCACCAATTGTGAGGTTTGGCTTGCT 

ACGTGGTTAACACTCCAAACCGAACGATTGCAAATTTCGTCTTCCTATAT 

AP I VRFGLLTFKAEGY 
MHQL.GLACSRLKQKDI 
CTNCEVWLAHV. SRRIY 

CTTGCTACGAGGTTTGCT^ 

GAACGATGCTCCAAACGAGTTGGTACACCTTCGTTAGTTTACGTGAACGA 
TCYEVCSTMWKQSNALA 
LATRFAQPCGSNQMHLL 
LLRGLLNHVEA I KCTC 
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ATGAGGTTTGGCTTGACTT ACTCGACAATGGACGCTNGTAAGTGAGAAGG 
i i i i 1 i ' i i I i i i i 1 i ■ i i I i ■ i i 1 i i i i I i i i i I ■ i i i 1 i i i i I i i i i | 
TACTCCAAACCGAACTGAATGAGCTGTTACCTGCGANCATTCACTCTTCC 
MRFGLTYSTMDA7K . EG 
. GLA LTRQWTLVSEK 
YEVWLDLLDNGR? . VRR 

jSpel 

gactanccaagacttagttggcaagga'ct 

AtGATNGGTtCTGaJItCA^ 

T7QDLVGKD S I LARQ 

GL7KT LARTSRYL LDN 

D7PRLSWQGLVDTCST I 

| Sal I 

agatgcct at aggt aatggattgactgag^ 
IctacggatatccaItacctaactga 

mp igngltet . stkts 

RCL VMD LRLSRQRLA 

DAYR.WID.DLVDKD. 

jXhol 

TGAGACTTAGTGGGCAATGGATGCCTATAA 
ACTCTGAATCACCCGTTACCTA^ 

DLVGNGCL . VRKDGSR 
ET .WAMDAYK ERMAR 
LRLSGQWMP I SKKGWLE 

ATTAATAAAGATCAAATAATTAATATA^ 

taattatttAtagtttattaattatat^ 

likik.li . iyqtlng 
d. rsnn.ykfikhlmd 

INKDQi ININLSNT.WT 

gcatataagtgagaaaggacggatcgagat 
cgtatattcactctItcctgccta^ 

ri.vrkdgsrlikik.l 
ayk.ertdrd. r s n n . 

H I SEKGRI E INKDQI I 
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ATATAAGTTTATCAAACN^ 

TATATTCAAATAGTTTGNGAATAATTNTG^ 
I VYQTL I 7TLDKRGTM 

Y K F I K? LL7HWTKEVL 
N I S L S N 7 Y . 7 IGQKRYY 

Q^AATATTAAAAT TCpCpGAGGCACAAATAT TAT TTCCpAAATACpTT T TCpTCCp 
CATTATAATTTTAACCCTCCGTGT 

Y . NWEAQI L.FPNTFL 
CN I K I GRHKYYFQ I LFS 
VILKLGGTNI ISKYFSP 

AATTCGGGAAGCGGTGGTAACGG^ 

LKPFATIAILIYFFYII 
LSPSPPLPF.SIFSI.L 
ALRHHCHFNLFFLYN 

ATCNCATAACATTCGTACATGAGATAT 
TAGNGTATTGTAAGCATGTACTC^ 
' ? HSYMRYDINLRPAL 
S H N I RT . DMT TFDLL 
Y 7 1 T F V H E I .HKPSTCF 

AGTAAACATNTTGA|TATNGTGACACC^ 
TCATTTGTANAACTAATANCACTGTGG^ 

VN?LI?VTPEAIILLT 
T?.L?.HQKP.YCLP 
SKH?DY?DTRSHN I AYL 

|AACATGATGGAGA|GAAC|TTAGTTGGTCC^ 
ATTGTACTACCTCTACTTGAAATCAACCA 

LT.WR.TLVGP7I.7ME 
HDGDEL . LVQ?SN?WK 
NMMEMNFSWS7YL 7NG 
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GTGGACAAGCACGATGACTAGGAT^ 
CACCTGTTCGTGCTACTGATCCTACC^ 
VDKHDD DGYMFMC LS 

WTSTMTRMATCSCVDF 
SGQAR . LGWLHVHVLTF 

CAAGTAATCAATCAAGCTGGAATCGAA^ 

GTTCATTAGTTAGTTCGACCTTAGCTTATTCTGCTAATTTCATCCCGCTA 

K.SIKLESNKTIKVGR 
PSNQSSWNR I RRLK GD 
QV I NQAG I E DD SRAM 

GACCATTAAGTTCAATGTCACGCTCATC^ 

CTGGTAATTCAAGTTACAGTGCGAGTAGTTGTATTAAGGTTGTGGCACGT 
PLSSMSRSST . FQHRA 
DH VQCHAHQHNSNTVQ 
TIKFNVTLINI IPTPC 

jBgl II 

GAAAGATCTTATCTTACATTGACT 

CTTTCTAGAATAGAATGTAACTGAACGGGTAGGCCGGCGGCCGTAGCTAA 
ERSYLTLTCPSGRRHRL 
KDL I LH LAHPAAG I D 

RKILSYIDLPIRPPASI 
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j EcoR I 

GGCGGAAACGAAGGGTCAGTCTC^ 

CCGCCTTTGCTTCCCAGTCAGAG 

AETKGQSPNSHSKDEF 
WRKRRVSLP I H IQRTNS 
GGNEGSVSQFTFKGRIH 

TTTTCATCAGATGAGCACTTCAGTCCT 
AAAAGTAGTCTACTCGTGAAGTCAGGAC^ 

IFIR.ALQSCL1IFYYY 
FSSDEHFSPA LYF I I I 

FHQMSTSVLLDY I LLL 

TATTATTATTAATTGAATGGTAAGTTT 

ataaIaataattaacttaccattcaaatgt^ 
yyy. lngkfteyidilv 

i i in.mvslqni . if. 
llll 1 ew. vyriyryfs 

ttcaat aaaat att ttaaaaaatgat 
aagttattttataaaatttIttactatttcc^ 

SIKYFKK. .REKVDLI 

fq. n 1 lkndkgrrwi s 
fnk i f kmi kgeggfdl 

taggatttttattgtgagcaataaaag^ 
atcctaaaaataacactcgK 

LGFLL . A ! K V F S . N F Q N 
DFYCEQ. KSLVRTSKM 
RIFIVSNKSL.LELPK 

GTGTCAAATGAACCCTAATAAGTGGGTT 

cacagtttacttgggattaItcacccaa^ 

VSNEP. .VGLVYGYDEI 
CQMN PNKWVWSMVTMR 
CVK.TLISGFGLWLR.D 
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CAGTATTTGTATATAAAAA^ 
GTCATAAACATATATTTTTTTAATAG^ 

S 1 C I KNYQLDFYFLT 
SVFVYKKI INLIFIF.P 
QYLYI KKLST. FLFFNP 

|TAATAAGTGGACA|GATATATCATAAT 
AATTATTCACCTGTACTATATAGTATT^ 

LNKWT.Yl I IKSCDV. 

l i sghd i s snhvm7de 
vdmiyhnqim.c7m 

gtnataacatattttttaataatnaaaattat 

cantattgtataaaaaattattanIttta^ 
vi tyfl i ?ki 7nrekir 
7 . h i f . ? k l ? [ e k k . 

S?NI FFNN7KY? .RKNK 

ATTACTATCCCTTCTATNGATGTNTTAT^ 

TAATGATAGGGAAGATANciACAN^ 

LLSLL7M7YN I L I PF? 
DYYPFY7C7I IF.SLSI 
I T I PS7DVL . YFNPF7Y 

TAGATTCACGTAGAATAAGAAAGATTATAATCGCATCAAATCAAATACAG 

I i i i I i i i i I i i i i I i i i i I i t i i 1 i i i i I i i i i I i i i i 1 i i i i I i i i i I 
ATCTAAGTGCATCTTATTCTTTCTAATATTAGCGTAGTTTAGTTTATGTC 

IDSRRIRKI I IASNQIQ 
IHVE.ERL.SHQIKYR 
RFT NKKDYNRI KSNT 

AATNAAATCATGCTTTTGACTTAATTCGAAAAATAATCTTCCTCTCTTGA 
1 i i i I i i ' i I i i i i I i i i i I i i i i 1 i ■ i ' I i ■ i i I i i i i I i ■ i i 1 i i i i 1 
TTANTTTAGTACGAAAACTGAATTAAGCTTTTTATTAGAAGGAGAGAACT 

N? IMLLT FEK. SSSLD 

7KSCF LNSKNNLPLL 
E7NHAFDLIRKI IFLS. 
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TAATATCCTTATTGATAAG^ 

attataggaataacIattcgtaanaatata 

nilidkh?yiyiy?yq 
i isllisi?iyiy?yin 
ypy. a7lyiyi ? 1st 

ttctaaaanatatttttaaattaattaaa 
aagattttntataaaaaHtaattaattt^ 
llk? ifkl1kfikikr. 
f.7ifln.lnlsk.kdk 
sk7yf. in.1yqnkk! 

actaaattagttctgcatcataatgta^ 
tgatttaatcaagacgtagtattacatca 
tklvlhhnvvsvrtce i 
ln.fci im. .v.elvk 
n. issas.cskcknl.n 

j Xba I j Spe I 

anggat'ctagaacactgatagaaaattccaaaccatta'ctagttctactt 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i » i i i i i i i i 
tncctagatcttgtgactatcttttaaggtttggtaatgatcaagatgaa 

? i ntdrkfqt i tsst 
7gsrtl i enskpllvll 
9dleh..kipnhy.fyl 
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GATGAAAACAAAACCATATAAAAGAATCCTCTTATATATATATATATATA 
1 i i i 1 i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i 1 i i i » 1 ■ i i ■ 1 
CTACTTTTGTTTTGGTATATTTTCTTAGGAGAATATATATATATATATAT 

KQNHIKESSYIY1YI 
DENKTi K N P L I Y I Y I Y 

MKTKPYKRILLYIYIY 

TATACTACTTTACTTATTCTTTGGACGTA^ 
ATATGATGAAATGAATAAGAAACCTGCATG^ 
YTTLL I LWTYNTSQETE 
I LLYLFFGRTTQVRKP 
1 YYFTYSLDVQHKSGNR 

AACAAAGGTGGCGGAAAGTTGGCAGA^ 

ttgtttccaccgcctttcaaccg^ 

tkvaeswq7lkrlfve 
kqrwrkvgr? rdfs . k 
nkgggkla7aeetfrrs 

tgaaggagacacacgtctataagaattgt 
acttcctctgtgtgcagatattctI^ 

vkethvyknchdytlkk 
rrhtsirivmtir.rk 
egdtrl els. lyaee 

aagaggggagagagagagaaggaagcgcc^ 
ItctcccctctctcIctctIc^ 
krgerekeaplltglvh 

RGERERRKRHC pvls 

kegreregsatvdrscp 

j Sal I j Sal I 

tgaggaattgtttg'tcgactaatgagca^ 

actccttaacaaacagctx^ 

eelfvd. .AVQTFVST 
mrnclstneqykhlcrq 
g i vcrlmsstn i cvdr 
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ATGGCAACAAATGAGAAGCGG^ 
TACCGTTGTTTACTCTTCGCCAT^ 

DGNK EAVSQHA i CSLW 

matnekrypntqsvafg 
wqqmrsgiptrnl.pl 

TCNCCAGACjTATCCAAAGACTTGCCT 
AGNGGTCTGAATAGGTTTCTGAACGGA 
SPDLSKDLPLRFPHAPH 
7QTYPKTCLCDFLMRL 
V?RL I QRLASA I SSCAS 

| Hind III 

tctgjtccaaaggaagctt^ 

agacaaggtttccttcgaagtgtcgcccgtccttaggtaaagagatatat 

lfqrklhsgqes i sly 
i cskgsftagrnpfly i 
svpkeasqrag i hfs i 

agcaccacctcccacccacaccaccac^ 
tcgtggtggagggtgggtgtggtg^ 

khhlpptppppppllrr 
sttshphhhhhhhc gg 
apppthtttttttake 

atgaaggccttgttgctggtcatttttaccctgg 
tacttccggtacaacgaccagtaaaaa^ 

MKALLLV I FTLASSLGA 
RPCCWSFLPWPRRSA 
D EGLVAGHFYPGLVARR 
CTTCGCCGAGCAATGCGGAAGGCAAGCCGGG 

GAAGCGGCTCGTTACGCCTTCCGTTCGGC^ 

FAEQCGRQAGGALCPG 
PSPSNAEGKPGGLSAPA 
LRRAMRKASRGGS L PRR 
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ggctgtgctgtagccagtacggctggtgcg^ 
ccgacacgacatcggtcatgccgaccac^ 

glccsqygwcgntdp7c 
gcavastagavtr i h ? a 
a v l pvrlvr. hgs7l 

ggtcaaggatgccananccaatgcnc 
ccagItcctacggtntnggttacg^ 
gqgc??qc??stpspst 

vkda7 7 na7aprpplp 
rsrmp7pm77lhalpfh 

tccgagcggcggtggcanngttggctcgatca 
aggctcgccgccaccgtnncaac^ 

PSGGG7VGS I I iSSLF 
LRAAVA7LARSSSPPSS 
SERRW77WLDHHLLPL? 

AGCAGATGCTGAAGCATCNCANCGA^ 

TCGTCTACGACTTCGTAGNGTTG^ 

7QMLKH77D7A7PG7GF 
SRC. S I 77TQPAPA7AS 
ADAEAS7R7S7PRQ7L 

FIG. 16G-3 
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AS^T 



TACNCGTNCACCGCCTTCATCTCCGC^ 
ATGNGCANGTGGCGGAAGTAGAGG^ 
Y ? ? T A F I SAA7S F7GFG 
TR?PPSSPPP?PS?GS 
L?VHRLHLRR?LL?RVR 

GACNACCNGCGACCACTCCACNAATAANANGGANATCNCGGCTTTCTTGG 

I i i i I i i i i I i i i i 1 i i i i \ i i i i 1 i i i i I i i i i 1 i i i i i i i i i I i i i i 1 
CTGNTGGNCGCTGGTGAGGTGNTTATTNTNCCTNTAGNGCCGAAAGAACC 

TT7DHSTN77? I ? A F L 
G7PATTP? I ???SRLSW 
D??RPLH? . ?G??GFLG 

TNCNGACNTCTCNCGAGACNACANGTA^ 

ANGNCTGNAGAGNGCTCTGNTGTNCATTAGGNANGNAGAGGGCTCCGAGC 
V?TS?ETT?NP??SRGS 
???L?R??VI??SPEAR 
?D?SRD?? S??LPRL 

TCTNCAGNTTATNGATAGACANCTN^ 
AGANGTCNAATANCTATCTGTNGAN^ 
S??Y?.T??CIG?GTWV 
LQ??DR? LNALG7ARG 
V??L?ID??MHWV?HVG 

GGTCCACCGTGCCCNATGGCCNTTC^ 
CCAGGTGGCACGGGNTACCG^ 

VHRA7WPFAWGYCFVQ 
WSTVP?G?SRGVTASS? 
GPPCPMA7RVGL L LRP? 

AACAGAACCCTCATCGGACTACTGCGTCGCCAGCTCGCANTGGCCGTGCG 
I i i i I i i i i | i i i i l i i i i | i i i i | i i i i | i i i i | i i t i | i i i i | i i i i 1 
TTGTCTTGGGAGTAGCCTGATGACGCAGCGGTCGAGCGTNACCGGCACGC 

?QNPHRTTASPAR?GRA 
NRTL IGLLRRQLA7AVR 
TEPSSDYCVASS7WPC 
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CTGCANGCAANAAATACTACGGCCGAAGCCCCATCCAAATCTCATTCAAC 
I i i i I i i i i | i i i i | i i i i | i i i i [ i i i i | i i i i | i i i i | i i i i | i i i i | 
GACGTNCGTTNTTTATGATGCCGGCTTCGGGGTAGGTTTAGAGTAAGTTG 
L?A?NTTAEAPSKSHST 
C?Q? I LRPKPHPNL IQ 
AA??KYYGRSPIQISFN 

T ACAACT ACGGGCCGGCCGGGAAAACCATCGGCTCCGACCTGCTCAACAA 

[ i i i N i i i 1 i i i i I i i i i I i i i i I i i i i I i i i i 1 i i i i I i t i i I i i i i I 
ATGTTGATGCCCGGCCGGCCCTTTTGGTAGCCGAGGCTGGACGAGTTGTT 

TTTGRPGKPSAPTCST 
LQLRAGRENHRLRPAQQ 
YNYGPAGKT IGSDLLNN 

CCCAGACCTGGTGGCCACCGACCCGACCATCTCCTTCAAGACGGCTCTGT 
i i i ■ I i i i i I ■ ■ i i 1 i i i i I i i i i I ■ i i i I ■ i i i I i i i i I i i i i 1 ' i i i I 
GGGTCTGGACCACCGGTGGCTGGGCTGGTAGAGGAAGTTCTGCCGAGACA 

TQTWWPPTRPSPSRRLC 
PRPGGHRPDHLLQDGSV 
PDLVATDPT I SFKTAL 

GGTTCTGGATGACTCCTCAGTCGCCCAAGCCGTCGTGCCACGACGTGATA 

I i i i I i i i i I i i i i 1 i i i i 1 ' i i i 1 i i i i I i i i i I i i i i I i i i i 1 i i i i I 
CCAAGACCTACTGAGGAGTCAGCGGGTTCGGCAGCACGGTGCTGCACTAT 

GSG. LLSRPSRRATT. 

VLD'DSSVAQAVVPRRD 

WFWMTPQSPKPSCHDVi 

ACCGGGAGCTGGACGCCATCCAACGCCGACCGGGCGGCCGGAAGGCTTCC 

I i i ■ I i i i i 1 i i i i I i i i i I i i i i I ■ i i i I i ' i i 1 i i i i 1 i i i i I i i i i 1 

tggccctcgacctgcggtaggttgcggctggcccgccggccttccgaagg 

pgagrhptptgrpegf 
nrelda i qrrpggrkas 
tgswtpsnadraagrlp 

gggct acggtgtcaccaccaacatcatc^ 
cccgatgccacagtggtggItgtagtagtIa 
ratvspptssmegwsag 
glrchhqhhqwrvgvre 
gygvttni ingglecg 
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AAGGGTCCGATGCCAGGGTGGCGGATAGG^ 

ttcccaggcIacggtcccaccgcc^ 
kgpmpgwr i gsast7gt 
rvrcqggg drllq7v 
kgsdarvadr i gfy7ry 

TGCGACTTGCTGGGGGTGAGCTACGGAG^ 

acgctgaacgacccccactcgatgcct^ 

atcwg atettwtatt 
lrlaggelrrqlgllqp 
cdl lgvsygdnldcyn? 

NAGTCCCTTTACTTANTCCGATACTA^ 
NTCAGGGAAATGAATNAGGCTATGATA^ 

?VPLL?RI LCANPCNNA 
?SLYL?RYYVRI HV I TQ 
SPFT * SDTMCESM . R 

ATAAACGCTACTGCTGAAATAGCGACT 

tattIgcgatgacgactttatcgctgaggc^ 
i natae i atp vdcrsc 
tlllk.rlrelivev 
nkryc . nsdsvs l kl 

| POLY A 

CGGAGGAAATCTTCAATAAAAGCTAAG^ 
GCCTCCTTTAGAAGTTATTTTCGATT^ 

GGNLQ.KLS.TSSWPS 
AEE I FNKS AEQVHGPQ 
RRKSS I KAKLNKFMALN 

TCATCGTTGATCGTCGTCAGATGCATC^ 

AGTAGCAACTAGCAGCAGTCTACGTAGGTAGTTTACAGAACCTCANTCAN 
I I VDRRQMHPSNVLE7V 
SSL I VVRC I HQMSWS7? 
HR.SSSDASIKCLGVS 
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AATGCGTTTTCNATCGGTAAATTGAAGATGTTAGAATAAATAAAATTATT 
! i i i I i i i i I i i i M i i i i 1 i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i [ 
TTACGCAAAAGNTAGCCATTTAACTTCTACAATCTTATTTATTTTAATAA 

NA7SIGKLKMLE. IKLF 
MR??SVN.RC.NK.NY 
?CVF?R. IEDVRINKI I 

TATTTTTTATAATTATAAATATTTTAAT 
ATAAAAAATATTAATATTTATAAAATT^ 

I FYNYKYFN I FFNLKD 
LFFI I INILIYFLILKI 
YFL.L. IF.YIF.S.RS 

CTAAAAAATCTNATTATAAGGATTTTATA^ 

gattItttagantaatattcctaaaa^ 

pkks7ykdfiyglgy. 7 
lknli !rilymdwdt?k 

Kl ?L .GFYIWIGI L? 

jBamH I 

aanttnattatnaaaattaatatacttttaatcttaaggatcctaaaaaa 

1 i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 i i i i 1 i i i i I ) i i i I 

ttnaantaatanttttaattatatgaaaattagaattcctaggatttttt 
?? i ?k1ni ll i lri lkk 
77l7kliyf.s.gs.k 
k??y?n .ytfnlkdpkk 

acataattataaggattttc^ 
IgtaItaatattcctaaaagatatacc^ 

HNYKDFLYG7GY . Q?? 
NIIIRIFYMD7DTN77. 

t.l.gfsiw7gilt77n 
ttgtaaaaatttnaatataaaattgttaaatctaaaaattaaaatactaa 

I i i i I ■ i i ■ 1 i i i i I i i i i I i i i ' I i i i ' 1 i i i i I i i i i 1 i i ■ i I i i ■ i I 

aacatttttaaanttatattttaacaatttagatttttaattttatgatt 
ivki7i.nc.i.klky. 
l . kf7yki vkskn . ntk 
ckn7nikllnlkikil 
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■EcoRV >Bg!l! 
AAATATATANTAATCATGA^ 

tttaIatatnattagtactatagcIct^ 
ky i 7 imi srmwrldle i 
ni7.s.yrecga.isr 
k i y7nhd i envalrsrd 

cgaggt tgag act an agnggaa att at^ t 
gctccaactctgatntcncctttaata 

evet??eiml imgnfl 
srlrl??klc. swe i ff 
rg.d??gnyvnhgkfsf 

TGTT|CCAAGACGA|GACCGTGGAAACCT 

acaaaggttctgctactggcacctttggattgtaggcgttagccagtacg 
lfprr pwkpn i rnrsc 
cfqdddrgimltsa i gha 
vsktmtvet . hpqsvm 

aataaccatgttatcatcantgaact 
ttattggtacaatagtagtnacttg^ 
nnhvi i7elvvvilrpq 
i tmlss7nlssssygh 
q. pcyh? tcrrhltat 

aatcacagtcttctancaaggcacgaat 
ttagtgtcagaagatngttccgtgctt^ 

i tvf7qgtn ! nesnvv 
ksqss7kari lmspt . y 

NHSLL7RHEY. .VQRSi 

C TATA TTGTTTTACATTT TAT ACCGTANTCGAGGTGTTCGCACGATTTTG 

I i i M i i i i I i i i i i i i i i I i i ) i [ i i i i i i i i i I i i ■ i I i i i i I i i i i 1 
GATATAACAAAATGTAAAATATGGCATNAGCTCCACAAGCGTGCTAAAAC 

S I LFYTF I P7SRCSHDL 
LYCFTLLYR7RGVRT I W 
Y I VLHFYTV7EVFARF 
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GCCCATCCCAAGTGCAT AAGATC^ 
CGGGTAGGGTTCACGTATTCTAGTAA^ 
AHPKC I RSL I . PLRWSV 
PIPSA.DH.YDLYVGA 
GPSQVHKI I DMTSTLER 

| Bgi II 

gttaacccga'gatctagttgagggggca^ 

caattgggctctagatcaactcccccgtatccagagtaaanggatgcacc 

ltrdlvega . vsf7yv 
c. pe i lrghrsh7stw 
vnprss.ggigl i ?lrg 

aggt|aaagatcacctttattncancc^ 

tccaatttctagtggaaataangtngggaacatctaagatttganctcca 
evkdhly??pcrf t?g 
rlk i tf i ??lvdsklev 
g.rspl??pl.iln?r 

ngatctctntaggagatcggtctcccttggaactctntaggggtncc 

I i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I i i — 739 
NCTAGAGANATCCTCTAGCCAGAGGGAACCTTGAGANATCCCCANGG 

?SL.EIGLPWNS?GVP 
DL7RRSVSLGTL G? 
? 1 S?GDRSPLEL?RG? 
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jBamH I 

GGATCCCAACTTTTAGGAATGGATCTTAAAATTTTAGTTATAAGTTCAAA 

I i i i I i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I i i ■ i I i i i i 1 
CCTAGGGTTGAAAATCCTTACCTAGAATTTTAAAATCAATATTCAAGTTT 

GSQLLGMDLK I LV I SSK 
DPNF.EWILKF.L.VQ 
R I PTFRNGSKNFSYKFK 

GTTAGAAAAATCTTTACCAAGAGCTTTGAGTCCATTGATGACATCCGTGA 

I i i i I i i i i I i i i i I i i ■ i I i i i ■ I i i ' i I i i i i I i i i i I i i i i I i i i i I 
CAATCTTTTTAGAAATGGTTCTCGAAACTCAGGTAACTACTGTAGGCACT 

LEKSLPRALSPLMTSV 
S.KNLYQEL.VH. .HP. 
VRKI FTKSFES IDDIRE 

AACGGTGT ACATGTCTCCG^ 
TTGCCACATGTACAGAGGCTA^ 

KRCTCLRWTHLVSFGKV 
NGVHVSDGLTWFHSEKF 
TVYMSPMDSLGF I RKS 

CGAAAGAGTGCAT AAGAATATTGATTTTGGATTCTT TCACTCGGTTGGTG 

i i i i I i i i i 1 i i i i 1 i i i i 1 i i i i I i i i i I i i i i 1 i i i i I i i i M i i i i I 
GCTTTCTCACGTATTCTTATAACTAAAACCTAAGAAAGTGAGCCAACCAC 

RKSA EY FWI LSLGWC 

ERVHKN I DFGFFHSVG 
SKEC IRI L I LDSFTRLV 

CCTTCATGAGTGACCTCAAGAGTCCTCCA^ 
GGAAGTACTCACTGGAGTT^ 

LHE PQESSKYQKPNH 
AFMSDLKSPPN I KSRI T 
PS.VTSRVLQISKAESQ 

j EcoR I 

aattgaaatgtgattg'aattcattttt 
ttaactttacactaacttaagta^ 

klkcd. ihfclmhktgh 
n.nviefifv.ctkqgi 

IEM.LNSFLSNAQNRA 
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TCATAGCCTTTGTG jTTAAAGCAA^ 
AGTATCGGAAACACAAATTTCGTTTTT^ 

S.PLCLKQKHSSPIHPI 
HSLCV.SKNiLLRFIP 
I AFVFKAKTFFSDSSH 
TCGCjCATCGGAAGAGAAAATTTTTGAAATC 
AGCGAGTAGCCTTCTCTTTTAAAAAC^ 

RSSEEKIFEIHFRQ.T 
FAHRKRKFLKS I FDNRP 
SL I GRENF NPFST I DQ 



AAGCTCGAAATCCA|GGAAATGAGGAAGAT^ 
TTCGAGCTTTAGGTACCTTTACTCCTTCTAGGAGTATACTCAAAAGGTTA 
KARNPWK GRSSYEFSN 
KLE I HGNEEDPHMSFP I 
SSKSMXEMRKf L I VFQ 

ACATGTAATjCGACTCATTAAA^ 

TGTACATTAAGCTGATGAATTTGTATCCACCTACACATTACTTTACTGGG 
TCNSTH.T.VDV. .NDP 
HV I RL I KHRWMCNEMT 
YM . FDSLN I GG.CVMK P 

TCATGCSCTATCTCTCTTGGGTATTAA 

AGTACGSGATAGAGAGAACCCATAATTTGGTTTATACTCTCACTCGGAAC 

HALSLL G I KPNMRVSL 
LM7YLSWVLNQ I E . A L 

SC? I SLGY. TKYESEPC 

CTCTGATACCAATTGTTAGGATCAGAGTG^ 

GAGACTATGGTTAACAATCCTAGTCTCACCGTGATTCTCTCCCCCCCTCT 
AL I P I VRI RVALREGGS 
L YQLLGSEWH . ERG GV 

SDTNC DQSGTKRGGE 



j Nco I 
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GAATTAGTGCAGTGGATTA^ 
CTTAATCACGTCACCTAATT^ 
ELVQWI KTYKFKNEFVN 
N.CSGLKLISLKMNS. 
ISAVD.NL.V.K. IRK 

jACGAGAAGATTTCGTTTTAATAGTAAC^ 

atgctcttctaaagcaaaattatcaH 

trrfrfnsnlsr.kpk 

I REDFVL 1 VT VDENQK 
YEKISF...LE.MKTSS 
TTAACAGTAGTGTAAATAACAATTTCGGGA^ 
AATTGTAATCACATTTATTGTTAAAGC^ 

VNSSVNNNFGKVRTHTF 
LTVV. ITISGK.ELTHS 
Q.CK.QFRESKNSHI 
AAGGAACATACCAAjTTAAAGTGGTTCGGT 
TTCCTTGTATGGTTAAATTTCACCA^ 
KEHTNLKWFGQNDLHPL 
R N I P I .SGSVKMTYIH 
QGTYQFKVVRSK PTST 



> EcoR I 
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TGTGAAGCC|TCTTCGAAGAGGCTC 
ACACTTCGGAAGAAGCTTCTCCGA 

VKPSSKRLPTSTSKSL 
L SLLRRGSQLPLANHF 
CEAFFEEAPNFH.QITL 

GAAGGGGAAGGACAAATACCTCTCTTAC^ 

CTTCCCCTTCCTGTTTATGGAGAG^ 

RGRTNTSLTTFYNGSY 
EGEGQ I PLL7PFTMVHT 
KGKDKYLSY7LLQWF I 

TCTTACAAATTTTCAACGAGAAAGAAGGA^ 
AGAATGTTTAAAAGTTGCTC^ 
SYKFSTRKKEVNMQA I E 
LTNFQRERRR. TCKQL 
LLQI FNEKEGGEHASN. 

AAACAAGACjTGCTAAAG^ 

TTTGTTCTGAACGATTTCTGAAACGXTTCCGAAAAAAAGAGTTAGATAAC 

NKTC.RLC.GFFSQSI 
KTRLAKDFAKAFFLNLL 
KQDLLRTLLRLFFS IYC 

CTTCjCAAAAGTTGTATTCTCTGC^ 

GAAGAGTTTTCAACATAAGAGACGACTCTTAACTCCCCATAAATATCTGG 
ASQKLYSLLRI EGYL T 
LLKSCI LC.ELRGIYRP 
FSKVVFSAEN.GVFID 
CCAAGAGGATTTAAATTTGGGCTCCAA^ 

GGTTCTCCTAAATTTAAACCCGAGGTTTAAAGCTTACGAGAACCCAAGGG 
PRGFKFGLQI SNALGFP 
QEDLNLGSKFRMLLGS 
PKRI IWAPNFECSWVP 
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GAGGjTGCCGGTGCCACCG^ 
CTCCAACGGCCACGGTGGCGGA^ 

RLPVPPPVSV HWTVY 
RGCRCHRLSVFDTGQCT 
EVAGATACQCLTLDSVL 

TCCCCACGGCGGCGGCCTGGAGAGCCCA^ 

RCHRRTSRVLGGATA 
SGATAGPLGCWAVPPPR 
AVPPPDLS GVGRCHRL 

TGAAAAAGTCGAGTGACCAACCTAAGGTT^ 

TFSAHWLDSKLDPNQSE 
LFQLTGWI PNLTQTSP 
DFFSSLVGFQT . PKPVR 
ACTCGGGTCCAATTGACCCGTAACCGGM 
TGAGCCCAGGTTAACTGGGCATTGGCCTA^ 

LGSN.PVTGL.D.PLI 
NSGSIDP.PDYRINP.S 
TRVQLTRNRI IGLTLNP 

|AACCCTAA|TATA|GCAAACTACGCAACT^ 
ATTGGGATTAATATACGTTTGATGCGTT 

LTLI ICKLRN.KYSPKQ 
P L YANYATEN I VLSK 

NPNYMQTTQLK I . S . A 

CAAAAATTGGCCGTTTGCAGC^ 
VFNRQTSSLLPA I FRQT 
FLTGKRRVFFRRSFGR 
SF PANVESSSGDLSAD 



FIG. 17B-2 



Appln. Filing Date: Herewith 

■E: DNA REGULATORY ELEMENTS ASSOCIATED 
[1 FRUIT DEVELOPMENT 
ntor(s): Gregory D. MA Yet al. 
Application serial No: CIP of 09/160,351 SHEET 62 of 94 



TTCTGATATACCTTTGGATTTCT 
AAGACTATAAGGAAACCTAAAGAAGATC^ 

SDIPLDFF.RTPSRVP 
LL IYLWI SSSGLLVGSR 
F YTFGFLLADS G P D 

TCTTGTGGCGAGTTT AGCGAGT AGCCG^ 
AGAACACCGCTCAAATCGCTCATCGGCT^ 

I LWRV RVAEPSR SP Q 

SCGEFSE PNLLGDLRK 
LVASLASSRTFSV I SA 

ACCGCCGATGATCTCTTCGGCAGACT 
TGGCGGCTACTAGAGAAGCCGTC^ 
TADDLFGRLSKTSTSPR 
PPM I SSADFRKLRQVP 
NRR SLRQTFENFDKSP 

AT TTCTTCTCGGTTGGTTCCGACAGCATCTCT AACGAAACTTCGGACACC 

I i i i I i i i i I ' i i i I i i i i 1 i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i I 
TAAAGAAGAGCCAACCAAGGCTGTCGTAGAGATTGCTTTGAAGCCTGTGG 

FLLGWFRQHL RNFGL 
D FFSVGSDS I SNETSDS 
I SSRXVPTASLTKLRTP 

TTGAATGTCCATCGAACTTGACTC^ 

AACTTACAGGTAGCTTGAACTGAGGCCAT^ 

LECPSNLTPVGLLY I FR 
LiMVHRT. LR.ACF I FSG 
MS I ELDSGRLALYFQ 

CTATCATAGTTAATCCTACATACTTAACT^ 

gatagtatcaattaggatgtatgaattg^ 
ls.lilht.lnnmd.in 
yhs.syilnsi iwirl 

Al IVNPTYLTQ.YGLD. 
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TTAACCCATCAATTGATTTCATCATCAAAATTCGACATTCAACAAACATC 
1 i i i I ii i ' 1 i i ' i I i ' i i I i i i i I i i ' i I i i i ■ I i i i i I i i i i 1 i i i i I 
AATTGGGTAGTTAACTAAAGTAGTAGTTTTAAGCTGTAAGTTGTTTGTAG 

P I N FHHQNSTFNKH 
INPSIDFI IKIRHSTNI 
LTHQL I SSSKFD IQQTS 

CGT ACTCAATAACCCATCA^ 

GCATGAGTTATTGGGTAGTCCGATATCAATGCACTGATAGATGACACTAG 
PYSITHQAIVT.LSTVi 
RTQ.PIRL.LRDYLL.S 
VLNNPSGYSYVT I YCD 

CGTACGTGAAGTTAGCGAGTCATGATCCAGGTCGTGTCACTTATTGGCCG 

I i i i I i i i i 1 ■ i i i I i i ' i I i i i i I i i ■ i I i i i i 1 i i i i 1 i i i i I i i i i I 
GCATGCACTTCAATCGCTCAGTACTAGGTCCAGCACAGTGAATAACCGGC 

RT.S.RVMIQVVSLIGR 
VREVSES SRSCHLLA 
PYVKLASHDPGRVTYWP 

AACACGTATCCCTTATCCAAATCCAGTCTTCTCAACTCTTCTAGCCTACC 

I i i i I i i i i 1 ■ i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i I i i i i I 
TTGTACATAGGGAATAGGTTTAGGTCAGAAGAGTTGAGAAGATCGGATGG 

TR I PYPNPVFSTLLAY 
EHVSL IQIQSSQLF PT 
NTYPLSKSSLLNSSSLP 

jEcoRI 

cgtctctttttttattacttttgaaag'a^ 
gcagagaaaaaaataatgaaaactttct^ 

pslfllllkefkskqiq 
rlffyyf knsnqnryk 
vsff i tferiqi ktdt 

aataacacggtgagacactgtgacatgct^ 
ItatIgtgccactctgtgacactgtacg^ 
nntvrhcdmlvsgkh . f 
itr.dtvtc.slesin 
k . hgetl haslwkal i 
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CGCGCATCCACAGACGTCGTCAGCTTCATCACCCACTTTTTCCTACATAA 

I i i i I i i i i I i i i i I i i i ' I i ' i i ! i i i ' I i i i i I i i i i I i i i i I i i i i I 
GCGCGTAGGTGTCTGCAGCAGTCGAAGTAGTGGGTGAAAAAGGATGTATT 

AHPQTSSASSPTFSY I 
SR I HRRRQLHHPLFPT 
RASTDVVSF I THFFLHN 

i Hind 111 

ccatgtcgcatggctttgttg ^tgI acagaccaccaca'agcttgcctttgg 

i i i i i 1 1 i i 1 i i i 1 i i i 1 i i ' mm 1 i i i i | i i i i | i i i i | i i i i 1 i i i i [ 
ggtacagcgtaccgaaacaactactgtctggtggtgttcgaacggaaacc 

tmshgfvddrppqaclw 
pcrmallmtdhhklafg 
hvawlc. qtttslpl 

ttgtgcct aacagagagagagagacag^ 

aacacggattgtctctctctctctgtctggctatcggaggagtaagtgat 
lcltererqtdsll ihy 

CA . QRERDRP i ASSFT 

vvpnreretdr pphsl 

[T3gcgatccgatcgccagcttcgctgctgttatttgcgttcctg|atg|ctt 

accgctaggctagcggtcgaagcgacgacaataaacgcaaggactacgaa 

gdpiasfaavicvpda 
ma i rspaslllfaflml 
wrsdrqlrccylrs cl 

iPstl 

gcgctcacgggaagactgca'ggcccggcgcagctcatgcattggcgtcta 
i i i i i i i m ] i i i i 1 i i i i i i i 1 i i 1 i i i i i i 1 i i i i i i i i i i i i i i i i i 
cgcgagtgcccttctgacgtccgggccgcgtcgagtacgtaaccgcagat 
cahgktagpaqlmhwrl 
altgrlqarrssc i gvy 
rsredcrpgaahalas 

iHind III 

CTGGGGACAAAACACCGACGAGGGA'AGC^ 

GACCCCTGTTTTGTGGCTGCTCCCTTCGAATCGTCTACGAACACGGTGTC 
LGTKHRRGKLSRCLCHR 
WGQNTDEGSLADACAT 
TGDKTPTREA . QMLVPQ 
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GCAACTACGAATACGTGAACATCGCCACCCTTTTCAAGTTTGGCATGGGC 

1 i i i I i i i i 1 i i i i 1 i i i i I i i i i I i i i i 1 i i ■ i I i i i i 1 i ' i i I i i i i I 
CGTTGATGCTTATGCACTTGTAGCGGTGGGAAAAGTTCAAACCGTACCCG 

QLR I REHRHPFQVWHG 
GNYEYVN I ATLFKFGMG 
ATTNT . TSPPFSSLAWA 

CAAACTCCAGAGATCAACCTCGCCGGCCACTGTGACCCTCGGAACAACGG 

[ I 1 I I I 1 I I | I I I I | 1 I I I | I I I I 1 I I I I | I I I I | I I I I | II M | I I 1 I ] 

GTTTGAGGTCTCTAGTTGGAGCGGCCGGTGACACTGGGAGCCTTGTTGCC 
PNSRDQPRRPL PSEQR 
QTPE I NLAGHCDPRNNG 
DLQRSTSPATVTLGTT 

CTGCGCGCGCTTGAGCAGCGAAATCCAGTCCTGCCAGGAGCGTGGCGTCA 
I i i i 1 i i i i ] i i i i I i i i i I i i i i I i i i i 1 i i i i 1 i i i i I i i i i I i i i i i 
GACGCGCGCGAACTCGTCGCTTTAGGTCAGGACGGTCCTCGCACCGCAGT 

LRALEQRNPVLPGAWRQ 
CARLSSE I QSCQERGV 
AARA . AAKSSPARSVAS 

AGGTGATGCTCTCCATCGG^ 

TCCACTACGAGAGGTAGCCTCCACCGCCCAGAATACCGGACTCAAGGTGG 

GDALHRRWRVLWPEFH 
KVMLS 1GGGGSYGLSST 
R.CSPSEVAGLMA.VPP 
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GAAGACGCCAAGGACGTAGCGTCATACCTCT^ 
CTTCTGCGGTTCCTGCATC^ 

RRRQGRSV I PLAQF LGW 
EDAKDVASYLWHSF LGG 
KTPRT RHTSGTVSWV 
jXhol 

ttctgctgctcgctac'tcga^ 
aagacgacgagcgatgagctgtgg^ 
fccslletprgcgsgwh 
saarysrplgdavldg 
vlllatrdpsgmrfwma 

tagacttcaacatcgccggagggagca^ 

atctgaagttgtagcggcctccctcgtgtcttgtgatactacttgaacgg 

rlqhrrehrtl . tcr 
i dfn i agstehydelaa 
tstspeaqntmmnlpl 

gctttcctcaaggcctaca^ 

cgaaaggagttccggatgttgctcgtcctccggccttgcttctttcaagt 
fpqglqraggrneeess 
flkayneqeagtkkkvh 
ssrpttsrrperrrkf 

cttgagtgctcgtccgcagtgtcctttc^ 

gaactcacgagcaggcgtcacaggaaagggcctaatgaccgaaccgttgc 
lecssavsfpgllawqr 
lsarpqcpfpdywlgn 
t.vlvrsvlsritglat 

jBgl II 

CACTCAGAACAGATCTCTTCGACT^ 

GTGAGTCTTGTCTAGAGAAGCTGAAGCACA 

TQNRSLRLRVGAVLQQ 
ALRTDLFDFVWVQFFNN 
HSEQ I SSTSCGCSSSTT 
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CCTTCGTGCCATTTCTCCCAGAAC^ 

GGA AGC ACGGT AAAGAGGGT CT TGCGA^ TACGCAAGTT 
PFVPFLPERYQSCKCVQ 
PSCHFSQNA I NLANAFN 
LRA I SPRTLS I LQMRS 

CAATTGGGTCATGTCCATCCCTGCGCAAAAGCTGTTCCTTGGGCTTCCTG 

I i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I 
GTTAACCCAGTACAGGTAGGGACGCGTTTTCGACAAGGAACCCGAAGGAC 

QLGHVHPCAKAVPWASC 
NWVMS I PAQKLFLGLP 
T I GSCPSLRKSCSLGFL 

CTGCTC'CTGAGGCTGCTCCAACTGGTGGCTACATTCCACCCCATGATCTC 

I i i i I i i i i I i i i i I i ■ i i I i i i ■ I i i i i 1 i i i i I i t i i [ i i i i | i i i i [ 
GACGAGGACTCCGACGAGGTTGACCACCGATGTAAGGTGGGGTACTAGAG 

CS . GCSNWWLHSTP . S 
AAPEAAPTGGY I PPHDL 
LLLRLLQLVATFHPM I S 

ATATCTAAAGTTCTTCCGATCCTA^ 

TATAGATTTCAAGAAGGCT^ 

HI SSSDPKGFRQVRRN 
ISKVLPI LKDSDKYAGI 
YLKFFRS Rl P T S T Q E 

CATGCTGTGGACTAGATACCACGACA^ 

gtacgacacctgatctatggtgctgtch 
havd. i prqklrlqfss 
mlwtryhdrnsgyssq 
sccgldtttetpatvlk 

tcaagtcccacgtgtgtccagcgc^ 

agttcagggIgcacacaggtc^ 

qvprvssasvlqhl i y 
vkshvcparrfsn i lsm 
ssptcvqrvgsptsylc 
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CCGGTGAAGTCTTCCAAGTAAACCTG^ 
GGCCACTTCAGAAGGTTCAT^ 

AGEVFQVNLN GVDDRWS 
PVKSSK. T. TA.MIGGR 
R. SLPSKPERRR SVV 

AAAACTCCGATCATCATGGGTCCCCAT^ 

Itttgaggctagtagtacccaggggtag^ 
ktpi imgphpypcvatl 

KLRSSWVP I Rl RALLR 
ENSDHHGSPSVSVRCYV 

ATGGTGTTTCCCTTGTATGTTGGTCTTTTCAATAATA 
TACCACAAAGGGAACATACAACCAGA^ 

WCFPCMLVFSI I . G V 

YGVSLVCWSFQ. YNKGL 
MVFPLYVGLFNN I I R G . 

GTTTTACGTTTCCATATTTTCCA^ 

CAAAATGCAAAGGTATAAAAGGTACAAGCTTTTGTCATATAAACGACGGG 
SFTFPYFPCSKTVYLLP 
VLRFH I FHVRKQY I CCP 
FYVS I FSMFENS I F A A 



FIG. 17D-3 
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CTTCCAAATTTGAAAAAGATAAAATAAATATATAACTAAAAATATCCTCT 
I i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i ' i i I i i i i N i i i I 
GAAGGTTTAAACTTTTTCTATTTTATTTATATATTGATTTTTATAGGAGA 

LPNLKKIK. IYN.KYPL 
FQI.KR.NKYITKNIL 
PSKFEKDKINI L K I S S 

TTTTTTTTTCTTTCGACAAATATATAA 
AAAAAAAAAGAAAGCTGTTTATATAH 

FFFFRQIYNS.LSQLF 
FFFSFDKY I TLNFPNCL 
FFFLSTNI .LLTFPIV. 

AGCAAAAGAT ATAAATCCTCTTCCACACAAAAGACGAATCCATGATTGCT 
I i i i | i i i i | i i i i | i i i i 1 i i i i | i i i i | i i i i | i i i i | i i i i 1 i i i i 1 
TCGTTTTCTATATTTAGGAGAAGGTGTGTTTTCTGCTTAGGTACTAACGA 

KQKI . I LFHTKDESM I A 
SKRYKSSSTQKTNP . LL 
AKD I NPLPHKRR I HDC 

GGATTGCTGTCTACTGGTGCCGAAAT 

cctaacgacagatgaccacgg^ 
gllstgaematreacat 
dccl lvpkwrreklvl 
wiavywcrngderslcy 

ctgcaattacaagttcgtcaacattgtct 
gacgttaatgttcaagcagttgtaa^ 

cnykfvn i vflamfgd 
pa i tssstlsslpclvt 
lqlqvrqhclpchvw. r 

ccatactcccgtgatcaggacacacctc^ 
ggtatgagggcactagtcctgtgtgg^ 

a i lp sghtsgtvswev 
pysrdqdtpleqflgkl 
htpv ! rthlwnsflgs 
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AATCTTCTTCTCGGCTCCTCGGCGACCAATCTTGTGAGGTTCTTCTCCTG 

I i i i I i i i i I i ■ i ' 1 i i ■ i I i i i i 1 i i i i I i i i i I i i i i 1 i i i ■ I i i i i 1 
TTAGAAGAAGAGCCGAGGAGCCGCTGGTTAGAACACTCCAAGAAGAGGAC 

NLLLGSSATNLVRFFS 
IFFSAPRRPIL.GSSP 
SSSRLLGDQSCEVLLL 

AATGGTGTCCACTTCGACATCGAAGGT^ 
TTACCACAGATGAAGCTGTAGCTT^ 

MVSTSTSKVYLSA7PQ 
EWCPLRHRRST A? I HS 

NGVHFD I EGLPER7STV 

TCCGACTACGTGTGGGTGCAGTTCTACTA^ 
AGGCTGATGCACACCCACGTCAAG 

FRLRVGAVLLHRQLADA 
SDYVWVQFYYTGNSQMP 
PTTCGCSSTTQATRRC 

CGGTAACAATGGGTTCTCCATCCTGCAT 

GCCATTGTTACCCAAGAGGTAGGACGTACCTTCCACAAGGGACCTGAAGG 
R QWVLHPAWKVFPGLP 
GNNGFS I LHGRCSLDF 
PVTMGSPSCMEGVPWTS 

jSacI jSpel 

TGCTGCTCCTCAGGCTGCTGGA^ 
ACGACGAGGAGTCCGACGACC^ 

AAPQAAGRSS I PLV I L 
LLLLRLLEGAPFH . S Y 

CCSSGCWKELHSTSDLT 

ACGTGTCTTATCATCAAGAATTATAGCAAGTACCGAGGGATTATTAAAAT 

I I M | I I I I | I I I I | I I I 1 | I I 1 I | I I I I i I I I I | I I II i 1 I I I 1 I I I I 1 

TGCACAGAATAGTAGTTCTTAATATCGTTCATGGCTCCCTAATAATTTTA 
HVSYHQEL . QVPRDY N 
TCL1 IKNYSKYRGI IKI 
RVLSSRI IASTEGLLK 
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AAAAAAAAAGGGAAGAATGGGAATTAG^ 

Ittttttttcccttcttaccctta 
kkkgkngn . n n nrp 
kkkgrmg i r ! ktetgh 
kkkreewelelklkpam 

AAGAACGTTTTCGAGTGAAGACAACGACAGTATGAGACGGTAGTTTGCTA 
I i i i I i i i i 1 i i i i 1 i i i i I i i i i I i i i i 1 i i i i t i i i i I i i ' i I i i i i I 
TTCTTGCAAAAGCTCACTTCTGTTGCTGTCATACTCTGCCATCAAACGAT 

RTFRVKTNDSMRR FA 
EERFE . RQTTV. DGSLL 
KNVSSEDKRQYETVVCY 

TGGACATGGATCGTTCCCAAAGCAGTCCAAGTCTTTATGAACCGGTCTAT 
I i i i I i i i i 1 i i i i I i i i i I i i i i I i i i i 1 i i i i 1 i i i i 1 i i i i I i i i i 1 
ACCTGTACCTAGCAAGGGTTTCGTCAGGTTCAGAAATACTTGGCCAGATA 

MDMDRSQSSPSLYEPVY 
WTWI VPKAVQVFMNRS I 
GHGSFPKQSKSL TGL 

CGGTTCAGCCTTCAAGAACCGCGAGGATAACCGGCCCAAGAGAAACAACA 
1 i i i I i i i i I i i i i I i i i i I i i i i I i i i i I i i i i [ i i i i I i i i i I i i i i I 
GCCAAGTCGGAAGTTCTTGGCGCTCCTATTGGCCGGGTTCTCTTTGTTGT 

RFSLQEPRG PAQEKQQ 
GSAFKNREDNRPKRNN 
SVQPSRTAR I TGPRETT 

FIG. 17E-3 
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aattgtggtgag 
ttaaAaccaAtcgaaaa™ 

i vvsf7ykpngavrqm 
klw. af? i nrtvpsvrc 
ncgell? . tercrpsdv 

j Bgl II 

TAAATGGACGGCGGATAGATCTCCAGAG^ 
ATTTACCTGCCGCCTATCTAGA 

LNGRR I DLQSKSEENRS 
MDGG. ISRVNLRKIVP 
KWTADRSPE I G K S F 

GGCCCCCCTACCACGACCCACGCG^ 
CCGGGGGGATGGTGCTGGGTGCGCTAGGC^ 
GPPTTTHA I RPLPHPLH 
APLPRPTRSVLSPTPY 
RPPYHDPRDPSSPPPPT 

EcoR I j 

CCTTTTTCTTCTTCCGCTCCTGCG^ 
GGAAAAAGAAGAAGGCGAGGACGCTAGCC^ 

LFLLPLLRSV I FCV. 
TFFFFRSCDRLFDFVYD 
PFSSSAPAIGYLILCMI 

ATCCAATTTCTTTTCTGGAGTGGTATCCT^ 

IaggItaaagaaaagacctcaccaIaggataa 
ypisflewypilis. iv 
iqflfwsgi lf flrll 
snffsgvvsysnfldc 

gtattgaaccatcagttttggtttaagc^ 

cctaacttggtagtcaaaaccaaattcgcgtactaccgcctctcaaagcc 
vlnhqfwfkrmmaesfg 
y.tisfglsa.wrrvs 
ciepsvlv.ahdggefr 
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AATCTTCTTCTCGGCTCCTCGGCGACCAATCTTGTGAGGTTCTTCTCCTG 

I i i i 1 i i i i | i i i i 1 i i i i 1 i i i i 1 i i i i | i i i [ l i i i i | i i i i | i i i i l 
TTAGAAGAAGAGCCGAGGAGCCGCTGGTTAGAACACTCCAAGAAGAGGAC 

NLLLGSSATNLVRFFS. 
1FFSAPRRPIL.GSSP 
SSSRLLGDQSCEVLLL 

AATGGTGTCCACTTCGACATCGAAGGTCTACCT 
TTACCACAGATGAAGCTGTAGCTTCCAGAT^ 

MVSTSTSKVYLSA7PQ 
EWCPLRHRRST . A? I HS 
NGVHFD I EGLPER7STV 

TCCGACTACGTGTGGGTGCAGTTCTACT 

AGGCTGATGCACACCCACGTCAAGATGATGTGTCCGTTGAGCGTCTACGG 
FRLRVGAVLLHRQLADA 
SDYVWVQFYYTGNSQMP 
PTTCGCSSTTQATRRC 

CGGTAACAATGGGTTCTCCATCCTGCAT^ 

GCCATTGTTACCCAAGAGGTAGGACGTACCTTCCACAAGGGACCTGAAGG 
R.QWVLHPAWKVFPGLP 
GNNGFS I LHGRCSLDF 
PVTMGSPSCMEGVP WTS 



TGCTGCTCCTCAGGCTGCTGGAAGGAGCT^ 
ACGACGAGGAGTCCGACGAC^ 

AAPQAAGRSS I PLV I L 
LLLLRLLEGAPFH .SY 
CCSSGCWKELHSTSDLT 

ACGTGTCTTATCATCAAGAATTATAGC^ 

TGCACAGAATAGTAGTTCTTAATATCGTTCATGGCTCCCTAATAATTTTA 
HVSYHQEL .QVPRDY N 
TCLI IKNYSKYRGI IKI 
RVLSSRI IASTEGLLK 



iSacI 



Spel 
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CTCTCCCGACCATTAGGATGAGGGTTGAA^ 
GAGAGGGCTGGTAATCCTACTC^ 

SPDH.DEG.R.KYFLV 
ALPT I RMRVEGENTFW. 
LSRPLG.GLKVKI LSGN 

TTTTCCTCTCTAAATTCTTCC^ 
AAAAGGAGAGATTTAAGAAGGTTTGTGCTG^ 

I FLS'KFFQTRHKYNYRP 
FSSLNSSKHDTS I I IDQ 
FPL. ILPNTTQV.L.T 

AGATTGATTCTTCTTATGCACCGATT 
icTAACTAAGAAGAATACGT^ 
RL I LLMHRFSLPFPLCY 
D.FFLCTDSHFPSLCV 
KIDSSYAPILTSLPSVL 

TGGTTATCGTTGTTACTGATGGTTGCTTA^ 

ACCAATAGCAACAATGACTACCA^ 

GYRCY .WLLNSWGSAW 
MV I VVTDGCLTHGVAPG 
WLSLLLMVA. LMG.. RLG 
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jPstl 

! | Sal! 

ACTAGGCAACTGGACGTCCAGCTG 
V I R P A G R 

S V D L Q V D 
DPLTCRST 
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! Hind III 



^ 5 
\TAAG 



TC ACTGGT ACGGGGCCCC^ 

AGTGACCATGCCCCGGGG^ 
SLVRGPPRGRRYR . AL I 
HWYGAPLEVDG I DKL 
LTGTGPPSRSTVS I SFD 

TTGATCTCTCTTCTTCAATCTCTCjCT 
AACTAGAGAGAAGAAGTTAGAGAGAGAGAGA 

SSLNLSLSLSLSLSLY 
SLLS I SLSLSLSLSLCM 
LFSQSLSLSLSLSLSVC 

TCTTTAAATATGGTTGTAATGCTGA^ 
AGAAATTTATACCAACATTACGAC^ 

VFKYGCNAEL LCLSWPN 
SLNMVVMLNCYVYLGQT 
L IWL.C. IAMFILAK 

TGTGTCCATCTTTGAGCAGATAAATCTG^ 
ACCCAGGTAGAAAclcGTCTATTTAGA 
CVHL ADKSGDNVLFTE 
VSIFEQINLAIMFFLL 
LCPSLSR. 1WR.CSFY. 

i Pstl 

aagcactgca'ggatgagggcctg^ 
ttcgIgacgIcctactcccggac^ 

stag gpe i tsdahwv 
kalqdeglkshrtptgs 

KHCRMRA NH I GRPLGH 

jNcol 

TGATGATATGGACTCCTCCACAGCGAGC^ 
ACTACTATACCTGAGGAGGTGTCGCTCGTC^ 

MM IWTPPQRAAMGCE I H 
YGL LHSEQPWDVRST 
DDMDSSTASSHGM. DP 
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ATAGCAGCG|AGATAAGGGAAGCCCGCA^ 
TATCGTCGCATCTATTCCCTTCGG^ 
IAA. IREARNTRLLLFQ 
QRR GKPATLGCCCS 
HSSVDKGSPQH.AVVVP 

CATTTCTAGCTTTCCAGTCCG 

RSKGQATVT I DFFEH 
SKDRKVRRQ.RSTFSSM 
VKIERSGDSDDRLFRA. 

ATGACAACGACGACCTGCTCCTGCAAT 

tactgttgctgctggacgaggacgIta 

DDNDDLLLQYPSPTVEW 
MTTTTCSCNIRPLP.SG 
QRRPAPA I SVPYRRV 

GAATAAATGGGTTTGTAGTTGCACTA^ 

CTTATTTACCCAAACATCAACGTGA^ 

E.MGL.LHYFSQELIES 
NKWVCSCT I SRRN . LK 
G I NGFVVALFLAG I N . K 
CCCTGCAAATTGCTGTTT^ 
GGGACGTTTAACGACAAAGAGAAAGGAAT 

PANCCFSFL I LNLPPV 
ALQI AVSLSLY. TFLLL 
PCKLLFLFPY I KPSSCY 

j BamH i j Bgl II 

CATTAAAAT|GCATGTTAAGACATTTCTG^ 

GTAATTTTAACGTACAATTCTGT^ G G CTTGTACTCTAG 

TLKLHVKTFLYGSEHE I 
H NCMLRHFCMDPNMRS 
IKIAC.DISVWIRT.D 



FIG. 18A-2 
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tatcattgaagtaatgggtagga^ 
atagtaactIcattacccatccta 
yh . sng. dlhyhhhhhl 

IIEVMGRIYIIIIIII 
LSLK .WVGFTLSSSSSS 

j Ncol 

c'catgggtttggatctaat^ 
ggtacccaaacctagattaatc^ 

HGFGSN.TENLI.NPT 
SMGLDL I RPKTSFK I QP 
PWVWI LDRKPHLKSNP 

CAATATTGGCTTGACTTGCTCCATC^ 

GTTATAACCGAACTGAACGAGGTAGAGGTTCTTTTTATGTTGTTCTTGTT 
P | LA. LAPSPRKIQQEQ 
QYWLDLLHLQEKYNKNN 
N IGLTCS I SKKNTTRT 

CAAAAATTTAGGATGCACATTGAATT 

gtttItaaatcctacgtgtaacttaacta 
qkfrmh i el iwsl enh 

knlgctln . fghyeri 
tki . d a h . i dlvtmres 
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|GGA|TAAAAATATTAAAAT^ 
ACCTAATTTTTATAATTTTATTTTTTA^ 

GLKILK.KINHNHLLT 
MD.KY.NKK. I I I IYSL 
WIKNIKIKNKS.SSTHS 

ATTGCTAAGTGTAAGATAGGTGGTTT 

LTiHILSTKFDIGF LI 
RFTFYPPNLTSASN F 
NDSHSIHQI.HRLLIN 

TGTTTTTTTTCCTTGTTTTTTTTGTGT^ 
SYIRF.KISPFDR. INI 
HILGSKKSLPLTDE I 
F I Y . VLKNLSL .QMNKY 

jTCTjTTAATTCGTTAGGGAAGGATCTAAT^ 
AAGAAAATTAAGCAATCCCTTCCTAG^ 

SFNSLGKDLI Y I Y I Y 

FLLIR.GRI .YN1YIYI 
FF.FVREGSNIIYIYIY 

ATAAATAAATAATCTAAGATTGGTAAAGAGAG 

I F I Y . ILTISLTRI . ID 
YLFIRF.PFLSQEYEST 
IYLLDSIMHFSHPNMNR 



G H I CKNPP IVHSKRSLN 
A I SAKTHQLFTVNAH 
RPYLQKPTNCSQ TL I E 



SEQA 
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TTAAGGTCGAAATTACTTTTAAAT^ 
AATTCCAGCTTTAATGAAAATTTAAAGAT 

GRNYF. ISRDFQ.Nf 
IKVE I TFKFLE ISNKIY 
LRSK L L LNF RFP I KYT 

TCGTATCTTTTACAGTGATGATGCTCCGG^ 
AGCATAGAAAATGTCACTACTACG 

LVSFTVMMLRM I RWKDA 
SYLLQ. . C S G . .DGRMR 
Rl FYSDDAPDDKMEGC 

TGTGTCAGCCGCCTGCGATCTCTGTGGC^ 
ACACAGTCGGCGGACGCTAGAGACACCGC^ 
CVSR LRS LWRGRDEDKD 
VSAACDLCGGDETKTR 
CCQPPA I SVAGTRRRQG 

CGTGAGCGGACGATACCAAGTCTTCTCCTCCCCCACCACGCACGTCTCAG 

1 1 I I 1 I I I I I I I I I I I I I I [ I I I I 1 I I 1 I I 1 I I I | I 1 » I | 1 I I I | I I I ! | 

GAACTCGCCTGCTATGGTTCAGAAGAGGAGGGGGTGGTGCGTGCAGAGTC 

VSGRYQVFSSPTTHVS 
T ADDTKSSPPPPRTSQ 
RERT I PSLLLPHHARLR 

AT TCCCGAT ACGGCCJ AT CCCGGTG^ 
TAAGGGCTATGCCGGATAGGGCCAC^ 

DSRYGL SRWRVDCTDER 
I PDTAYPGGVWTAQTNE 
FP I RP I PVACGLHRRT 

GTAAATGCCCATCCCCCCTCTTTCATT 
CATTTACGGGTAGGGGGGAGAA^ 
VNAHPPSF I LSLCVCER 
MP I PPLSFFLFACVR 
SKCPSPLFHSFSLRV.E 



FIG. 18B-2 
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gagcgcctataaataagca^ 
ctcgcggatatttaItcgtgcttt^ 

sayk arnkplfsprt 
gap i nkhetspfslqeh 
erl. i stk qapflsknt 

ACCACACCATTCACACAC^ 
TGGTGTGGTAAGTGTGTGATGTA^ 

HHT I HTLHPLLLRAFSP 
TTPFTHY I LCFFEPFRL 
PHHSHTTSSASSSLFA 

■Sail 

tccttcctcgtctaaccatg'tcgacct 
aggaaggagcagattggtacagct^ 
sflv pcrpaatatalt 

psssnhvdlrqlrlr 
flprltmstcgncdcvd 

AAGAGCCAGTGCGTGTAAGTCATCCTCCA^ 

Itctcggtcacgcacattcagtagg^ 

rasacksss i ppplll 
qepvrvshppslhlfff 
ksqcv.vilhpstssss 



FIG. 18B-3 
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j Sal I 

jTCTTCTTCTTCTTCTTCT^ 

AAGAAGAAGAAGAAGAAGATTGGAGCGGGGCAAACACAAACTACTCAGCT 
LLLLLLLTSPRLCLMSR 
FFFFFF . PRPVCV . V D 

SSSSSSNLAPFVFDES 

SEQB - 

ACTCTTCCCACATCGCTCGTCAAAACTCA|GAGCTTTATTAGGGAACTCAG 
I i i i I i i i i I i i i i I i i i i I i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i 
TGAGAAGGGTGTAGCGAGCAGTTTTGAGTCTCGAAATAATCCCTTGAGTC 

LFPHRSSKLRALLGN I S 
SSH IARQNSELY.GTS 
TLPTSLVKTQSF I REHQ 

^AATACT ATATGT AT ATGTAN AAGG^ 

gttatgatatacatatacatnttccagttgcaaccgacttcttgaaccaa 

nt i c i c7rstlaeelg 
a i lyvyv7gqrwlknlv 
qyymym7kvnvg. rtwf 

ttgcctttgcaggaagaaaggaaacagct 
aacggaaacgtccttcttt^ 

fafagrketatvs i llr 
lplqeerkqlryryc.d 
clcrkkgnsyg i d i v e 

ccgagaagaggtactgatt^ 
ggctcttctccatgactaatcgaagaa 
prrgtd. lllppprrg. 
reevl isffslllved 
tekry . lasspssssrm 

ajcaaactaattaggattaca 
tagtttgattaatcctaatgtggaata^ 

sn lglhl i tlpnafs 
dqtn dytllpylmlfp 
ikliritpyylt.cffr 



FIG. 18C-1 
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E^^A 



jSall 

TATTCGTTTCGTCTCTTCAGC^ 
ATAAGCAAAGCAGAGAAGTCGA^ 

VFVSSLQLRRRGDRCRR 
YSFRLFSYVDEV ! V A A E 
I RFVSSATSTR SLPQ 



AGCTGCCGAGCATGACGGCAAGTGCAAGTGCGGCGCCGCCTGCGCCTGCA 
1 i i M i i i i I i ' i ' I i i i i 1 ' i i i I i i i i 1 ■ i ' i I i i i i I i i i i I i i i i I 
TCGACGGCTCGTACTGCCGTTCACGTTCACGCCGCGGCGGACGCGGACGT 

SCRA RQVQVRRRLRLH 
AAEHDGKCKCGAACAC 
KLPSMTASASAAPPAPA 

CCGACTGCAAGTGTGGCAAC lTGAl GAAGCACTTGTGTCACTACCACTAAAA 

1 l l ■ 1 i i i i i t i i i | i i i i [ttti | i i i i 1 i i i i | i i i i 1 i i i i 1 i i i i | 
ggctgacgttcacaccgttgactcttcgtgaacacagtgatggtgatttt 

rlqvwqlrstcvtttk 
tdckcgn . ealvslpln 
ptasvatekhlchyh. i 

aaaagtttgcaatgcataaaaaacaaaa^ 
ItttcaaacgttacgtattItttgIttt 

kfamhkkqknkkkkgr 
kslqc i knkrtkkkkee 
kvcna ktkeqkkkrk 

agaagaaggtgtggctatgtactctaataattcgggcaggctgataagtt 

I i i ■ 1 i i i i 1 i i i i I i i i i I i i i i I i i i i 1 i i i i I i i i i I i i i i 1 i i i i I 

tcttcttccacaccgatacatgagattattaagcccgtccgactattcaa 
rrrcgyvl . fgqadrl 
eegvamysnnsgrl i g 
kkkvwlctli irag. .v 

gtaagatgggataacgcagtatcatctgtgttatctctgtcctgtgttac 

1 i i ■ 1 i ' i i 1 i i i i I i i ' i I ' i i ' 1 i i i i 1 ' i i i 1 ' i i i I i i ' ' 1 i i i i I 
CATTCTACCCTATTGCGTCATAGTAGACACAATAGAGACAGGACACAATG 

DG I TQYHLCYLCPVL 
CKMG.RSI ICVISVLCY 
VRWDNAVSSVLSLSCVT 



FIG. 18C-2 



Appln. Filing Date: Herewith 

Title: DNA REGULATORY ELEMENTS ASSOCIATED 

WMBRUIT DEVELOPMENT 

InvHwor(s): Gregory D. MAY et al. 

Application serial No: CIP of 09/160,351 SHEET 83 of 94 



AACTCTCCTATCTATCCTAGTCAATGAAATA^ 
TTGAGAGGATAGATAGGATCAGTTACTTTAT 

QLSYLS . SMKYY . Y SG 
NSPIYPSQ.Ni ISINLV 
TLLSILVNEILLVLLW 

TGTGTCATTCATATATGCTGCTGCTGCTGC 

acacagtaagtataiacgacgacgacgac^ 
cv i h i ccccccflfhqs 

vsf i yaaaaaassftn 
lchsymllllllplsp i 

aacccaaaggatcgattgcactgtaagg^ 
ItgggtttcctagctaacgtgacaItc^ 

TQR I DCTVRPNFLTDM 
QPKGSIAL.GPTSSP1C 
NPKDRLHCKAQLPHRYA 

SEQD 

tcgctcagttacgatgaatgaacagcaaccaaacgagtctgc 

I m i 1 i i i i [ i i i i | i i i i 1 i ill | i i i i [ i i i i | i i i i 1 i-L — 2392 

agcgagtcaatgctacttactt[gtcgttggtttgctcagacg| 
laqlr mnsnqtsl 

SLSYDE . TATKRVC 
RSVTMNEQQPNESA 
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Apa I 



'Xhol 



Sail 
Acc I 
i Hinc II 



Clal 



'Hind III 



TCACTGGTACGGGGCCCCCCTCGAGGTCGACGGTATCGATAAGCTTTGAT 



H— 



AGTGACCATGCCCCGGGGGGAGCCCCAGCTGCCATAGCTATTCGAAACTA 
SLVRGPPRGRRYR AL I 

HWYGAP L EVDG I DKL 
XTGTGPPSRSTVS I SFD 

CTCTTCTCTCAATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTGTATG 



-+- 



-h 



-+- 



-+- 



-+- 



GAGAAGAGAGTTAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGAGACATAC 

SSLNLSLSLSLSLSLY 
SLLS I SLSLSLSLSLCM 
LFSQSLSLSLSLSLSVC 

CTTT AAAT ATGGTTGT AATGCTGAA^ 

GAAATTTATACCAACATTACGACTTAACGATACAAATAGAACCGGGTTTG 
XFKYGCNAEL LCLSWPN 
SLNMVVMLNCYVYLGQT 
L.IWL.C.IAMFILAK 

TGTGTCCATCTTTGAGCAGATAAATCT 

ACACAGGTAGAAACTCGTCTATTTAGACCGCTATTACAAGAAAAATGACT 
CVHL .ADKSGDNVLFTE 



VSIFEQINLA 
LCPSLSR. IWR 



M F F L L 
C S F Y 



AAGCACTGCAGGATGAGGGCCTGAAATCACATCGGACGCCCACTGGGTCA 



-f- 



TTCGtGACGtCCTACTCCCGGACTtTAGTGTAGCCTGCGGGTGACCCAGT 

STAG GPE ! TSDAHWV 

KALQDEGLKSHRTPTGS 
KHCRMRA . NH I GRPLGH 

TGATGATATGGACTCCTCCACAGCGAG^ 

ACTACTATACCTGAGGAGGTGTCGCTCGTCGGTACCCTACACTCTAGGTG 
MM IWTPPQRAAMGCE I H 
YGL LHSEQPWDVRST 
DDMDSSTASSHGM DP 



-h 



f GTAGCCT 



-+- 
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AT AGCAGCGT AGAT AAGGGAAGCCCGCAACACT AGGCTGTTGTTGTTCCA 

1 I I I | I I I I | i I I I 1 I I I I | I I I I MM I | I I I I 1 1 1 I I | I I I I | I I I I | 

TATCGTCGCATCTATTCCCTTCGGGCGTTGTGATCCGACAACAACAAGGT 
X A A . IREARNTRLLLFQ 
QRR GKPATLGCCCS 
HSSVDKGSPQH AVVVP 

GTAAAGATCGAAAGGT^ 
CATTTCTAGCTTTCCAGTCCGCTGT^^ 

RSKGQATVT I DFFEH 
SKDRKVRRQ RSTFSSM 
VK I ERSGDSDDRLFRA 

ATGACAACGACGACCTGCTCCTGCAATA^ 
TACTGTTGCTGCTGGACGAGGACGTTAT^ 

DDNDDLLLQYPSPTVEW 
MTTTTCSCN I RPLP SG 

qrrpapa i svpyrrv 
gaataaatgggtttgtagttgcacta^ 
ctaaIttacccaaacatcaacgtgat^ 
e mgl lhyfsqel i es 
nkwvcsctisrrn.lk 

GINGFVVALFLAGiN.K 
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CCCTGCAAATTGCTGTTTCTCTT^ 
GGGACGTTTAACGACAAAG^ 

PANCCFSFL I LNLPPV 
ALQIAVSLSLY. TFLLL 
PCKLLFLFPY I KPSSCY 

j BamH I 

CATTAAAAT|GCATGTTAAGACATTTCT 

GTAATTTTAACGTACAATTCTGTAAAGACATACCTAGGCTTGTACTCTAG 
TLKLHVKTFLYGSEHE I 
H NCMLRHFCMDPNMRS 
IKIAC.DISVWIRT.D 

T ATCATTGAAGT AATGGGT AGGAT TTACATTATCATCATCATCATCATCT 
I i i ' 1 i i ' i I i i i i I i i i i I i i i i I i i i i I i i i i 1 i i i i 1 i i i i I i i i i I 
ATAGTAACTTCATTACCCATCCTAAATGTAATAGTAGTAGTAGTAGTAGA 

YH SNG.DLHYHHHHHL 
I IEVMGRIYI I I I I I I 
LSLK .WVGFTLSSSSSS 

jBstXI 

ccatgggt'ttggatctaattagaccgaa^ 
ggtacccaaacctagattaatctggc^ 

hgfgsn tenl i n p t 

SMGLDL I RPKTSFK IQP 
PWVWI LDRKPHLKSNP 
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XXAT AT TGGCT TGACTTGC^ 
XXTATAACCGAACTGAACGAGG^ 

XI LA. LAPSPRKIQQEQ 
XYWLDL LHLQEKYNKNN 
N IGLTCS i SKKNTTRT 

CAAAAATTTAGGATGCACATTGAATTGATTTGGTCACTATGAGAGAATCA 

I I M | I 1 I I 1 I I I I | I I I I 1 I I I I I i I I I | 1 I I I | I I I I | I I I I | I I | | 1 

GTTTTTAAATCCTACGTGTAACTTAACTAAACCAGTGATACTCTCTTAGT 
QKFRMH I EL IWSL ENH 
KNLGCTLN . FGHYER I 
TKI . D A H . IDLVTMRES 

TGGA|TAAAAATATTAAAATAAAAAATAAAT 

ACCTAATTTTTATAATTTTATTTTTTATTTAGTATTAGTAGATGAGTGAG 
GLKILK.KINHNHLLT 
D.KY.NKK. I I I IYSL 
WIKNIK1KNKS.SSTHS 

TAACGATTCACATTCTATC^ 

ATTGCTAAGTGTAAGATAGGTGGTTTAAACTGTAGCCGAAGATTAATTAA 
LTIHILSTKFDIGF.LI 
RFTFYPPNLTSASN F 
NDSHSIHQI.HRLLIN 

TCATATATTAGGTTCTAAAAAATCTC^ 

AGTATATAATCCAAGATTTTTTAGAGAGGGAAACTGTCTACTTATTTATA 
SYIRF.KISPFDR. INI 
HILGSKKSLPLTDE. I 
F I Y VLKNLSL QMNKY 

TTCTTTTAATTCGTTAGGGAAGGATCTAATATAATATATATATATATATA 
I i i i I i i i ' I i i i i I i i i i I i i i i I i i i i ] i i i i I i i i i i i i i i I i i i i I 
AAGAAAATTAAGCAATCCCTTCCTAGATTATATTATATATATATATATAT 

SFNSLGKDL I Y I Y I Y 

FLLIR.GRI .YNIYIYI 
FF.FVREGSNIIYIY1Y 
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TATTTATTTATTAGATTCTA^ 

ATAAATAAATAATCTAAGATTGGTAAAGAGAGTGGTCTTATACTTAGCTG 
I F I Y . ILTISLTRI . ID 
YLF I RF PFLSPEYEST 
IYLLDSNHFSHQNMNR 

MTZSEQA - 

GGCCATATCTGCAAAAACCCAC^ 
CCGGTATAGACGTTTTTGGGTGGTTAACAAG^ 
GH I CKNPP I VHSKRSLN 
A I SAKTHQLFTVNAH 
RPYLQKPTNCSQ. TL I E 

jXbal 

ttaaggtcgaaattacttttaaattt'ct^ 
aattccagcIttaatgaaaattta^ 

grnyf. isrdfq.ni 
ikveitfkfleisnkfy 

LRSKLLLNF.RFPIKYT 
TCGTATCTTTTACAGTGATGATGCT^ 
AGCATAGAAAATGTCACTACTACGAGGCCTAC^ 

LVSFTVMMLRMI RWKDA 
SYLLQ. . C S G . .DGRMR 
R I FYSDDAPDDKMEGC 

TGTGTCAGCCGCCTGCGATCTCTGTGGCG^ 

ACACAGTCGGCGGACGCTAGAGACACCGCCCCTGCTCTGCTTCTGTTCCT 
CVSRLRSLWRGRDEDKD 
VSAACDLCGGDETKTR 
VCQPPA I SVAGTRRRQG 

CGTGAGCGGACGATACCAAGTCTTCT 

GCACTCGCCTGCTATGGTTCAGAAAAGGAGGGGGTGGTGCGTGCAGAGTC 

V SGRYQVFSSPTTHVS 
T ADDTKSSPPPPRTSQ 
R ERT I PSLLLPHHARLR 
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AT TCCCGAT ACGGCCT ATCCC^ 
TAAGGGCTATGCCGGATAGGGCCACC^ 

DSRYGL SRWRVDCTDER 
I pDTAYPGGVWTAQTNE 
FP I RP I PVACGLHRRT 

GTAAATGCCCATCCCC^ 

cattIacgggtaggggggagaaagtaagaaa^ 

VNAHPPSF I LSLCVCER 
MP I PPLSFFLFACVR 

skcpsplfhsfslrv.e 

gagcgcctataaataagcacgaaacaa^ 
ctcgcggatatttaItcgtgctttgttcg^ 

sayk.arnkplfsprt 

GAP I NKHETSPF SLQEH 
ERL. iSTKQAPFLSKNT 

ACCACACCATTCACACACTACATCCTCT^ 

tggtgtggtaagtgtgtgatgtaggaga^ 

HHT I HTLHPLLLRAFSP 
TTPFTHY I LCFFEPFRL 
PHHSHTTSSASSSLFA 
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| Sal I 
! ! Acc I 

; j j Hind II Hind II j 

XCCTTCCTCGTCTAACCATGTC^ 
XGGAAGGAGCAGATTGGTACAGCTGG 
SFLV . PCRPAATATALT 
PSSSNHVDLRQLRLR. 
XLPRLTMSTCGNCDCVD 



j INTRON 

AAGAGCCAGjGCGTGTAAGTCATCCTCCA^ 
TTCTCGGTCACGCACATTCAGT^ 

R ASACKSSSIPPPLLL 
QEPVRVSHPPSLHL FFF 
KSQCV.VILHPSSSSSS 



Hind II ! 
Acc I | 
Sal I j ! 

jTCTjCTTCTTCTTCTTCTA^ 
AAGAAGAAGAAGAAGAAGATTGGAGC^ 

LLLLLLLTSPRLCLMSR 
FFFFFF.PRPVCV. . V D 
SSSSSSNLAPFVFDES 



MTZ SEQ B 

CTCTjCCCACATCGCTCG^ 
GAGAAGGGTGTAGCGAGCAGTTTTGAGTCTC^ 
LFPHRSSKLRALLGNIS 
SSH I ARQNSELY . GTS 
TLPTSLVKTQSF I REHQ 
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j Hinc II 

caatactatatgtatatgta^ 
gttatgatatacatatacaInttccag^ 

nt i c i c7rstlaeelg 
a i lyvyv7gqrwlknlv 
gyymym7kv nvg rtwf 

j INTION | MT2 Bam/ MT2 SEQ B 

ttgcctttgcaggaagaanggaaaca^ 
aacggaaacgtccttcttncctH 

FAFAGR7ETATVS I LL? 
LPLQEE?KQL?Y?YC. ? 
CLCRK7GNSY? I ? I V? 

CCGAAAANAGGTACTGATTANCTTCTT 

GGCTTTTNTCCATGACTAATNGAAGAAGAGGGAGGAGGAGCAGCTNCTAC 
PK?GTD?LLLPPPRR? 
R K ? V L I ?FFSLLLV?D 
TE7RY L?SSPSSSS?M 

ATCAAACTAATTAGGATTACNCCTTATTAC 
I i i i I m i i I i i i i I i i i i 1 i i i i | i i i i I - 1880 
TAGTTTGATTAATCCTAATGNGGAATAATG 

SN.LGL7LIT 
DQTN DY7LL 
\ K L I R I T P Y Y 
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